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GCTGGGTACC GGGCCCGTCG ACGCGTATAT AGTTCCTCCT TTCAGCAAAA AACCCCTCAA 60 


i 



• 0 

14 9 


GACCCGTTTA 

GAGGCCCCAA 

GGGGTTATGC 

TACTGCAGGC 

TCTCCCTTAG 

CCATCCGAGT 

120 

GGACGTGCGT 

CCTCCTTCGG 

ATGCCCAGGT 

CGGACCGCGA 

GGAGGTGGAG 

ATGCCATGCC 

180 

GACCCACCAA 

ACAAGAGAAG 

AAACTTGTCT 

GGGAATATAA 

ATTTAACTTT 

AAATTAACTT 

240 

AGGATTAAAG 

ACATTGACTA 

GAAGGTCAAG 

AAAAGGGAAC 

TCTATAATTT 

CAAAAATGTT 

300 

GAGCCTATTT 

GATACATTTA 

ATGCACGTAG 

GCAAGAAAAC 

ATAACAAAAT 

CAGCCGGTGG 

360 

AGCTATCATT 

C CTGGACAGA 

AAAATACTGT 

CTCTATATTC 

GCCCTTGGAC 

CGACAATAAC 

420 

TGATGATAAT 

GAGAAAATGA 

CATTAGCTCT 

TCTATTTCTA 

TCTCATTCAC 

TAGATAATGA 

480 

GAAACAACAT 

GCACAAAGGG 

CAGGGTTCTT 

GGTGTCTTTA 

TTGTCAATGG 

CTTATGCCAA 

540 

TCCAGAGCTC 

TACCTAACAA 

CAAATGGAAG 

TAATGCAGAT 

GTCAAGTATG 

TCATATACAT 

600 

GATTGAGAAA 

GATCTAAAAC 

GGCAAAAGTA 

TGGAGGATTT 

GTGGTTAAGA 

CGAGAGAGAT 

660 

GATATATGAA 

AAGACAACTG 

ATTGGATATT 

TGGAAGTGAC 

CTGGATTATG 

ATCAGGAAAC 

720 

TATGTTGCAG 

AACGGCAGGA 

ACAATTCAAC 

AATTGAAGAC 

CTTGTCCACA 

CATTTGGGTA 

780 

TCCATCATGT 

TTAGGAGCTC 

TTATAATACA 

GATCTGGATA 

GTTCTGGTCA 

AAGCTATCAC 

840 

TAGTATCTCA 

GGGTTAAGAA 

AAGGCTTTTT 

CACCCGATTG 

GAAGCTTTCA 

GACAAGATGG 

900 

AACAGTGCAG 

GCAGGGCTGG 

TATTGAGCGG 

TGACACAGTG 

GATCAGATTG 

GGTCAATCAT 

960 

GCGGTCTCAA 

CAGAGCTTGG 

TAACTCTTAT 

GGTTGAAACA 

TTAATAACAA 

TGAATACCAG 

1020 

CAGAAATGAC 

CTCACAACCA 

TAGAAAAGAA 

TATACAAA-TT 

GTTGGCAACT 

ACATAAGAGA 

1080 

TGCAGGTCTC 

GCTTCATTCT 

TCAATACAAT 

CAGATATGGA 

ATTGAGACCA 

GAATGGCAGC 

1140 

TTTGACTCTA 

TCCACTCTCA 

GACCAGATAT 

CAATAGATTA 

AAAGCTTTGA 

TGGAACTGTA 

1200 

TTTATCAAAG 

GGACCACGCG 

CTCCTTTCAT 

CTGTATCCTC 

AGAGATCCTA 

TACATGGTGA 

1260 

GTTCGC AC C A 

GGCAACTATC 

CTGCCATATG 

GAGCTATGCA 

ATGGGGGTGG 

CAGTTGTACA 

1320 

AAATAGAGCC 

ATGCAACAGT 

ATGTGACGGG 

AAGATCATAT 

CTAGACATTG 

ATATGTTCCA 

1380 

GCTAGGACAA 

GCAGTAGCAC 

GTGATGCCGA 

AGCTCAAATG 

AGCTCAACAC 

TGGAAGATGA 

1440 

ACTTGGAGTG 

ACACACGAAT 

CTAAAGAAAG 

CTTGAAGAGA 

CATATAAGGA 

ACATAAACAG 

1500 

TTCAGAGACA 

TCTTTCCACA 

AACCGACAGG 

TGGATCAGCC 

ATAGAGATGG 

CAATAGATGA 

1560 

AGAGCCAGAA 

CAATTCGAAC 

ATAGAGCAGA 

TCAAGAACAA 

AATGGAGAAC 

CTCAATCATC 

1620 

CATAATTCAA 

TATGCCTGGG 

CAGAAGGAAA 

TAGAAGCGAT 

GATCAGACTG 

AGCAAGCTAC 

1680 

AGAATCTGAC 

AATATCAAGA 

CCGAACAACA 

AAACATCAGA 

GACAGACTAA 

ACAAGAGACT 

1740 

CAACGACAAG 

AAGAAACAAA 

GCAGTCAACC 

ACC CACTAAT 

CCCACAAACA 

GAACAAACCA 

1800 

GGACGAAATA 

GATGATCTGT 

TTAACGCATT 

TGGAAGCAAC 

TAATCGAATC 

AACATTTTAA 

1860 

TCTAAATCAA 

TAATAAATAA 

GAAAAACTTA 

GGATTAAAGA 

ATCCTATCAT 

ACCGGAATAT 

1920 

AGGGTGGTAA 

ATTTAGAGTC 

TGCTTGAAAC 

TCAATCAATA 

GAGAGTTGAT 

GGAAAGCGAT 

1980 

GCTAAAAACT 

ATCAAATCAT 

GGATTCTTGG 

GAAGAGGAAT 

CAAGAGATAA 

ATCAACTAAT 

2040 

ATCTCCTCGG 

CCCTCAACAT 

CATTGAATTC 

ATACTCAGCA 

CCGACCCCCA 

AGAAGACTTA 

2100 

TCGGAAAACG 

ACACAATCAA 

CACAAGAACC 

CAGCAACTCA 

GTGCCACCAT 

CTGTCAACCA 

2160 
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GAAATCAAAC 

TCTGGGTCAT 

GTACAGAGAG 

GGAGGAATCC 

GATCTCAATG 

GCAAATGTTC 

AACAGTGATC 

GTTGTTACTG 

AAGAAAAGTT 

AAAGACTGGT 

TCCACATCAA 

GGGCAAACAG 

ATCGACAACA 

AGATCAACTT 

AAGACAAATG 

ATTACTCTAT 

GACAAACGAG 

GATTTCCTGG 

ATACAAAATG 

AGATTGATAG 

AAAATTATGA 

ATGATCAAAA 

GAGGCACAAG 

GAGAACGATG 

AGACTAATAC 

AGCAATCTCT 

GATGAAGAAG 

TCCAACAAAG 

AACACACACA 

AACTTAGGAT 

AT AC AC ATT C 

AGTCAATGAA 

AAAACACGGA 

CAAAGACAAA 

CTCTGGATCA 


CAACAGAAAC 

CACACGAATG 

GACCTGGGAG 

CCAGAAGCAT 

AAATTAGAAA 

CAAGCGAGAT 

ATGGAAGAAG 

CTGCAACACC 

CTTCAACACA 

TTAAGAAATC 

AAGGGCAGAA 

AAATACAGAC 

ACACCGACCG 

ATACAAAAGA 

GAAAGGAAAG 

TGCAGAATCT 

TTGTATGTGT 

CAGGATTAGT 

AAATGCTAAA 

AAAATCAAAG 

CTGAGAGAGG 

CAAAATTGAA 

GCATTGACAA 

TACAAGTTAA 

CCAAAAAAGT 

CACAAAGCAC 

TATCTGAATT 

AAACGACACC 

AAATCAAGCA 

TAAAGAATAA 

CCAGAATCAT 

CAGAGGAAAG 

TCCCGGTATT 

TACGGGAGTG 

TTACCAATCG 


AAGTGAGAAA 
TACAACAGAA 
AAGAAGCAGC 
CACAGATTCT 
GATGGATAAG 
ATCAGGAAGT 
CCTGGAATCT 
AGATGATGAA 
TCAAGAAGAT 
AAAAGATACC 
GAAAATCTCA 
AGAATCATCA 
GAACGAACAG 
ATCGATCCGA 
GAAGGATACA 
TGGTGTAATT 
AGCAAATGTA 
CATAGGGGTT 
CCTCAAAGCA 
AGAACAACTG 
AGGAAAGAAA 
AGAAGAAAAG 
GAATATACCC 
ATCAGAGATA 
GAG C AGT AC A 
AAAACAATCA 
AATGGACATG 
GAACAAACAG 
GAATGAAACA 
ATTAATCCTT 
CATTCTCTGA 
CAGTACCCCA 
TAGATGTCTT 
TGAATGATCT 
GATTGGCTAA 


GATAGTGGAT 
GCAAAAGATA 
TCAGATAGTA 
AAAAATGGAA 
GACTCTATTG 
GATGACATAT 
ATCAGTACAC 
GAAGAAATAC 
GACAAAAGAA 
GACAACCAGA 
AAGACAACAA 
GAAACACAAT 
ACAAGCACAA 
ACAAACTCTG 
GAAGAGAGCA 
CAATCCACAT 
CTAAACAATG 
TCAATGGACA 
GATCTAAAGA 
TCATTGATCA 
GACCAAAATG 
ATCAAGAAGA 
GATCTATATC 
TTAAGTTCAT 
ATGAGATCAC 
TACATAAACG 
TTCAATGAAG 
ACAAGAAACA 
AC AG AT AT C A 
GTCCAAAATG 
AAATGGTCAT 
CATTAGAGTT 
CTTACTCGGC 
CGACAGTGAC 
GTACACTGGG 


CAACTGACAA 

GAAATATTGA 

GAGCTGAGAC 

CCCAAAACAC 

AGGGGAAAAT 

TTACAACAGA 

CTGATACAAG 

TAATGAAAAA 

TTAAAAAAGG 

TACCAACATC 

CCACCAACAC 

CCTCATCATG 

CTCCTCCAAC 

AATCCAAACC 

ATCGATTTAC 

CAAAACTAGA 

TAGATACTGC 

ACGACACAAA 

AAATGGACGA 

CGTCACTAAT 

AATC CAATGA 

CCAGGTTTGA 

GACATGCAGG 

ACAATGAGTC 

TAGTTGCAGT 

AACTCAAACG 

ATGTCAACAA 

ACAGTAGATC 

ATCAATATAC 

AGTATAACTA 

ATAGAACCAT 

GCCAAGATCG 

TTCTTCGAGA 

CCGAGTTACA 

AATGACCAGG 


AAATAGACAG 
TCAGGAAACT 
TGTGGTCTCT 
GGAGGATATT 
GCGACAATCT 
ACAAAGTAGA 
ATCAATAAGT 
TAGTAGGACA 
GGGAAAAGGG 
AGACTACAGA 
CGACACAAAG 
GAATCTCATC 
AACAACTTCC 
CAAGACACAA 
AGAGAGGGCA 
TTTATATCAA 
AT C AAAG AT A 
ATTAACACAG 
ATCACATAGA 
TTCAAATCTC 
GAGAGTATCC 
CCCACTTATG 
AGATACACTA 
AAATGCAACA 
CATCAACAAC 
TTGCAAAAAT 
TTG C CAATGA 
AAAACCTGTC 
AAATAAGAAA 
ACT CTGC AAT 
TACCACTCAA 
GAAATCCACC 
TGGAACGAAT 
AAGTTTGTGG 
AATTGTTACA 


2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 

3780 

3840 

3900 

3960 

4020 

4080 

4140 

4200 

4260 
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AGCCGCAACC AAACTGGATA TAGAAGTGAG AAGAACAGTC AAAGCGAAAG AGATGGTTGT 43 2 0 
TTACACGGTA CAAAATATAA AACCAGAACT GTACCCATGG TCCAATAGAC TAAGAAAAGG 43 8 0 
AATGCTGTTC GATGCCAACA AAGTTGCTCT TGCTCCTCAA TGTCTTCCAC TAGATAGGAG 444 0 
CATAAAATTT AGAGTAATCT TCGTGAATTG TACGGCAATT GGATCAATAA CCTTGTTCAA 4500 
AATT C CTAAG TCAATGGCAT CACTATCTCT ACCCAACACA ATATCAATCA ATCTGCAGGT 4 56 0 
ACACATAAAA ACAGGGGTTC AGACTGATTC TAAAGGGATA GTTCAAATTT TGGATGAGAA 462 0 
AGGCGAAAAA TCACTGAATT TCATGGTCCA TCTCGGATTG ATCAAAAGAA AAGTAGGCAG 4680 
AATGTACTCT GTTGAATACT GTAAACAGAA AATCGAGAAA ATGAGATTGA TATTTTCTTT 4 74 0 
AGGACTAGTT GGAGGAATCA GTCTTCATGT CAATGCAACT GGGTCCATAT CAAAAACACT 4 8 00 
AGCAAGTCAG CTGGTATTCA AAAGAGAGAT TTGTTATCCT TTAATGG AT C TAAATCCGCA 486 0 
TCTCAATCTA GTTATCTGGG CTTCATCAGT AGAGATTACA AGAGTGGATG CAATTTTCCA 4 92 0 
ACCTTCTTTA CCTGGCGAGT TCAGATACTA TCCTAATATT ATTGCAAAAG GAGTTGGGAA 4 98 0 
AATCAAACAA TGGAACTAGT AATCTCTATT TTAGTCCGGA CGTATCTATT AAGCCGAAGC 504 0 
AAATAAAGGA TAATCAAAAA CTTAGGACAA AAGAGGTCAA TACCAACAAC TATTAGCAGT 5100 
CACACTCGCA AGAATAAGAG AGAAGGGACC AAAAAAGTCA AATAGGAGAA ATCAAAACAA 516 0 
AAGGTACAGA ACACCAGAAC AACAAAATCA AAACATCCAA CTCACTCAAA ACAAAAATTC 522 0 
CAAAAGAGAC CGGCAACACA ACAAGCACTG AACACAATGC CAACTTCAAT ACTGCTAATT 528 0 
ATTACAACCA TGATCATGGC ATCTTTCTGC CAAATAGATA TCACAAAACT ACAGCACGTA 534 0 
GGTGT AT TGG TCAACAGTCC CAAAGGGATG AAGATATCAC AAAACTTTGA AACAAGATAT 54 00 
CTAATTTTGA GCCTCATACC AAAAATAGAA GACTCTAACT CTTGTGGTGA C CAACAGATC 5460 
AAGCAATACA AGAAGTTATT GGATAGACTG ATCATCCCTT TATATGATGG ATTAAGATTA 552 0 
CAGAAAGATG TGATAGTAAC CAATCAAGAA TCCAATGAAA ACACTGATCC CAGAACAAAA 558 0 
CGATTCTTTG GAGGGGTAAT TGG AAC C ATT GCTCTGGGAG TAGCAACCTC AGCACAAATT 5640 
ACAGCGGCAG TTGCTCTGGT TGAAGCCAAG CAGGCAAGAT CAGACATCGA AAAACTCAAA 5 700 
GAAGCAATTA GGGACACAAA CAAAGCAGTG CAGTCAGTTC AGAGCTCCAT AGGAAATTTA 576 0 
ATAGTAGCAA TTAAATCAGT CCAGGATTAT GTTAACAAAG AAATCGTGCC ATCGATTGCG 5820 
AGGCTAGGTT GTGAAGCAGC AGGACTTCAA TTAGGAATTG CATTAACACA GCATTACTCA 58 8 0 
GAATTAACAA ACATATTTGG TGATAACATA GGATCGTTAC AAGAAAAAGG AATAAAATTA 594 0 
CAAGGTATAG CATCATTATA CCGCACAAAT ATCACAGAAA TATTCACAAC AT C AAC AGTT 6000 
GATAAATATG ATATCTATGA TCTGTTATTT ACAGAATCAA TAAAGGTGAG AGTTATAGAT 6060 
GTTGACTTGA ATGATTACTC AATCACCCTC CAAGTCAGAC TCCCTTTATT AACTAGGCTG 6120 
CTGAACACTC AGATCTACAA AGTAGATTCC ATATCATATA ACATCCAAAA CAGAGAATGG 6180 
TATATC CCTC TTCCCAGCCA TATCATGACG AAAGGGGCAT TTCTAGGTGG AGCAGACGTC 624 0 
AAAGAATGTA TAGAAGCATT CAGCAGCTAT ATATGCCCTT CTGATC CAGG ATTTGTATTA 63 00 
AACCATGAAA TAGAGAGCTG CTT AT CAGG A AACATATCCC AATGTCCAAG AACAACGGTC 6360 
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TTGTTCCAAG 



ACATCAGACA 

ACAACCACCT 

GTAAAAATTA 

ACAAATAAAG 

GTTGCACTTG 

GAATCAAAAG 

CAATCTAGCA 

ATAACGATAA 

CAAAATGACA 

AAAATTATAA 

AAGGACCCAA 

AAGGATGCTG 

AAGATAATAT 

GTGCTAATTA 

AATGAGTTTA 

ATACAGTCAG 

CCAATATCAT 

AGAAATGATA 

TTAAATCCAG 

AAAATAAGGT 

GTTAGAACTC 

ACTCGAGGTT 

GTAAACTCAG 

GACAATAGGA 

actcccaaag 

GATATTGTCA 
TTTGATCAAC 
AAAATAATAT 
AACACAACTG 
TGGTTTTCAG 
ATTCCAAAAT 
AGGTTACTTC 
AAGTTACAAT 
TGG C ATAATG 
GATGGATGTA 


GTACATGCAA 
TAACACATAA 
AAGGAACTCT 
ATCCAATTGA 
AATGGATAAG 
CTACAATCAT 
TTACAATTGC 
AGCCATATGT 
AAAACTTAGG 
TAGACAAATC 
GTAATGAGCT 
ACATATTATG 
ATTCCATCAA 
TGGAAATTAC 
GAGTGAATAC 
TGACACAACA 
AT C AAGAAGT 
ATGATTTTTG 
TAATGCCAGG 
CGTCTTTAGT 
GTCAGGATAT 
ACTTGGTACC 
AGTCATGTTC 
TTGATGAAAG 
ATTATGATGG 
CATATGCTGC 
TTCTCGGGTA 
GGTGCCCCGG 
ATAGGAGGAT 
TGAAAGTATG 
TACTAGGTAA 
TAGGAATAAT 
TGCTATCAAG 
TAACAGGAGT 


ATATGCATTT 
CGGAATTGGT 
AGAATGTAGT 
TGCATTCTAT 
CATATCAATC 
AAGGTCAAAT 
AATTATTTTG 
AATTAAGTAT 
ACTAACAAAC 
AGTAAAGTTA 
CAAATTCGAG 
GGAGACGTCT 
GACAATAATC 
AAGTGAAAAG 
AGAAAAGATC 
AAGGCTTCTT 
GATGTCAGAT 
GCTGCCACAA 
GAGATGCACG 
GCCGGGATTA 
TATAAATGAT 
AGGAAAATCA 
TGACTTAAAT 
TCTAGCACTC 
ATCAGATTAT 
TTCAATCTCA 
ACTAT ACC C A 
TGGAGGTCTT 
GAAAACACAG 
GGTCAACTCC 
GACGATATCT 
CAAGATCTAT 
TGATATTACT 
ACCAGGAAAC 
ATATACTGAT 


GTCAATGGAG 
AATAGAATCA 
ACAATAGGTA 
AC AC CAAATG 
GAGCTCAACA 
CAAAAACTAG 
ATAATGATCA 
TACAGAATTC 
AAATAACATA 
CGCAATCCAA 
ATGGAATACT 
ATGGCTACTC 
CTGGTGTTAT 
GCCCACGAAT 
CAAATGGCAT 
ACAATTCAGA 
CTTAGGAAAT 
AGAATAACAC 
TCTGGTCTTC 
TT AGC TATGC 
CTGATTTATG 
TATCAAGTCT 
C CTAGGATCT 
CTAAATATAG 
GCATCATCAG 
ACAACAAGAT 
TCTGTTGGAC 
GAACATCCAA 
AGAGAC TGT A 
ATCATTGTTG 
ATGCGACAAA 
ATATATACAA 
GATTACAGTG 
AATGAATGTC 
GCATATCCAC 


GAGTGGTTGC 

ATCAACCACC 

TCAACGGAAT 

ATATAACACT 

AGGCCAAATC 

ATTCTATTGG 

TTATATTGTT 

AAAAGAGAAA 

TCTACAGATC 

CTCTACTCAT 

GGAAGCATAC 

ATGGCAACAA 

TATCAATAGT 

CATTGCTGCA 

CGGATAATAC 

GTCATGTCCA 

TCATTAGTGA 

ATGATGTAGG 

CATCTTTAAT 

CAACGACTGT 

CTTATACCTC 

TACAGATAGG 

CTCATACCTT 

ATGTATATCA 

GCATAGAAGA 

TTAAGAATAA 

CAGGGATATA 

TAAATGAGAA 

ATCAAGCATC 

TTGACAAAGG 

ATTACTGGGG 

GATCTACAAG 

ATATAAGGAT 

CATGGGGACA 

TCAATCCCAC 


AAACTGTATA 

TGATCAAGGA 

GCTGTTCAAT 

AAACAATTCT 

AGATCTAGAA 

AAATTGGCAT 

TATAATTAAT 

TCGAGTGGAT 

ATTAGATATT 

ATAATTGAGG 

CAATCACGGA 

GCTCACTAAT 

CTTCATCATA 

AGACATAAAT 

CAATGATCTA 

GAATTACATA 

AATTACAATT 

TATAAAACCT 

GAAAACTCCA 

TGATGGCTGT 

AAATCTAATT 

GATAATAACT 

TAACATAAAT 

ACTGTGTTCA 

TATTGTACTT 

TAACATAAGC 

CTACAAAGGC 

TGTAATCTGC 

TCATAGTACT 

CTTAAACTCA 

GTCAGAAGGA 

TTGGCATAGC 

AAAATGGACA 

TTCATGTCCA 

AGGGAGCATT 


6420 

6480 

6540 

6600 

6660 

6720 

6780 

6840 

6900 

6960 

7020 

7080 

7140 

7200 

7260 

7320 

7380 

7440 

7500 

7560 

7620 

7680 

7740 

7800 

7860 

7920 

7980 

8040 

8100 

8160 

8220 

8280 

8340 

8400 


8460 
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GTGTCATCTG TCATATTAGA CTCACAAAAA TCGAGAGTGA ACCCAGTCAT AACTTACTCA 8 520 
ACAGCAACCG AAAGAGTAAA CGAGCTGGCC ATC CTAAACA GAACACTCTC AGCTGGATAT 8 580 
ACAACAACAA GCTGCATTAC ACACTATAAC AAAGGATATT GTTTTCATAT AGTAGAAATA 8 640 
AATCATAAAA GCTTAAACAC ATTTCAACCC ATGTTGTTCA AAACAGAGAT TCCAAAAAGC 8 700 
TGCAGTTAAT CATAATTAAC CATAATATGC ATCAATCTAT CTATAATACA AGTATATGAT 8 76 0 
AAGTAATCAG CAATCAGACA ATAGACAAAA GGGAAATATA AAAAACTTAG GAGCAAAGCG 882 0 
TGCTCGGGAA ATGGACACTG AATCTAACAA TGGCACTGTA TCTGACATAC TCTATCCTGA 8880 
GTGTCACCTT AACTCTCCTA TCGTTAAAGG TAAAATAGCA CAATTACACA CTATTATGAG 8 94 0 
TCTACCTCAG CCTTATGATA TGGATGACGA CTCAATACTA GTTATCACTA GACAGAAAAT 90 00 
AAAAC TTAAT AAATTGGATA AAAGACAACG ATCTATTAGA AGATTAAAAT TAATATTAAC 906 0 
TGAAAAAGTG AATGACTTAG GAAAATACAC ATTTATCAGA TATCCAGAAA TGTCAAAAGA 912 0 
AATGTTCAAA TTATATATAC CTGGTATTAA CAGTAAAGTG ACTGAATTAT TACTTAAAGC 918 0 
AGATAGAACA TATAGTCAAA TGACTGATGG ATTAAGAGAT CTATGGATTA ATGTGCTATC 924 0 
AAAATTAGCC TCAAAAAATG ATGGAAGCAA TTATGATCTT AATGAAGAAA TTAATAATAT 93 00 
ATCGAAAGTT CACACAACCT ATAAATCAGA TAAATGGTAT AATCCATTCA AAACATGGTT 9360 
TACTATCAAG TATGATATGA GAAGATTACA AAAAGCTCGA AATGAGATCA CTTTTAATGT 942 0 
TGGGAAGGAT TATAACTTGT TAGAAGACCA GAAGAATTTC TTATTGATAC ATCCAGAATT 94 80 
GGTTTTGATA TTAGATAAAC AAAACTATAA TGGTTATCTA ATTACTCCTG AATTAGTATT 9540 
GATGTATTGT GACGTAGTCG AAGGCCGATG GAATATAAGT GCATGTGCTA AGTTAGATCC 96 00 
AAAATTACAA TCTATGTATC AGAAAGGTAA TAACCTGTGG GAAGTGATAG ATAAATTGTT 9660 
TCCAATTATG GGAGAAAAGA CATTTGATGT GATATCGTTA TTAGAACCAC TTGCATTATC 972 0 
CTTAATTCAA ACTCATGATC CTGTTAAACA ACTAAGAGGA GCTTTTTTAA ATCATGTGTT 978 0 
ATC CGAGATG GAATTAATAT TTGAATCTAG AGAATCGATT AAGGAATTTC TGAGTGTAGA 9840 
TTACATTGAT AAAATTTTAG ATATATTTAA TAAGTC TAC A ATAGATGAAA TAGCAGAGAT 990 0 
TTTCTCTTTT TTTAGAACAT TTGGGCATCC TC CATTAGAA GCTAGTATTG CAGCAGAAAA 996 0 
GGTTAGAAAA TATATGTATA TTGGAAAACA ATTAAAATTT GACACTATTA ATAAATGTCA 10020 
TGCTATCTTC TGTACAATAA TAATTAACGG ATATAGAGAG AGGCATGGTG GACAGTGGCC 10080 
TCCTGTGACA TTACCTGATC ATGCACACGA ATTCATCATA AATGCTTACG GTTCAAACTC 1014 0 
TGCGATATCA TATGAAAATG CTGTTGATTA TTACCAGAGC TTTATAGGAA TAAAATTCAA 10200 
TAAATTCATA GAGCCTCAGT TAGATGAGGA TTTGACAATT TATATGAAAG ATAAAGCATT 10260 
ATCTCCAAAA AAATCAAATT GGGACACAGT TTATCCTGCA TCTAATTTAC TGTACCGTAC 10320 
TAACGCATCC AACGAATCAC GAAGATTAGT TGAAGTATTT ATAGCAGATA GTAAATTTGA 10380 
TCCTCATCAG ATATTGGATT ATGTAGAATC TGGGGACTGG TTAGATGATC CAGAATTTAA 1044 0 
TATTTCTTAT AGTCTTAAAG AAAAAGAGAT CAAACAGGAA GGTAGACTCT TTGCAAAAAT 10500 
GACATACAAA ATGAGAGCTA CACAAGTTTT ATC AGAG AC C CTACTTGCAA ATAACATAGG 10560 
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AAAATTCTTT 

CAAGAAAATG 

GGATGGTGAA 

GGGAGAGATT 

GAATTACTTA 

AGAGATTAAC 

10620 

AACCATATCA 

ATATCAGGAG 

TTCCACGGTA 

TAATGAAGTG 

TACAATAATT 

CTAAAAGCCA 

10680 

TACAGATGAC 

CTTAAAACCT 

ACAATAAAAT 

AAGTAATCTT 

AATTTGTCTT 

CTAATCAGAA 

10740 

ATCAAAGAAA 

TTTGAATTCA 

AGTCAACGGA 

TATCTACAAT 

GATGGATACG 

AGACTGTGAG 

10800 

CTGTTTCCTA 

ACAACAGATC 

TCAAAAAATA 

CTGTCTTAAT 

TGGAGATATG 

AATCAACAGC 

10860 

TCTATTTGGA 

GAAACTTGCA 

ACCAAATATT 

TGGATTAAAT 

AAATTGTTTA 

ATTGGTTACA 

10920 

CCCTCGTCTT 

GAAGGAAGTA 

CAATCTATGT 

AGGTGATCCT 

TACTGTCCTC 

CATCAGATAA 

10980 

AGAACATATA 

TCATTAGAGG 

ATCACCCTGA 

TTCTGGTTTT 

TACGTTCATA 

ACC C AAGAGG 

11040 

GGGTATAGAA 

GGATTTTGTC 

AAAAATTATG 

GACACTCATA 

TCTATAAGTG 

CAATACATCT 

11100 

AGCAGCTGTT 

AGAATAGGCG 

TGAGGGTGAC 

TGCAATGGTT 

CAAGGAGACA 

ATCAAGCTAT 

11160 

AGCTGTAACC 

ACAAGAGTAC 

CCAACAATTA 

TGACTACAGA 

GTTAAGAAGG 

AGATAGTTTA 

11220 

TAAAGATGTA 

GTGAGATTTT 

TTGATTCATT 

AAGAGAAGTG 

ATGGATGATC 

TAGGTCATGA 

11280 

ACTTAAATTA 

AATGAAACGA 

TTATAAGTAG 

CAAGATGTTC 

ATATATAGCA 

AAAGAATCTA 

11340 

TTATGATGGG 

AGAATTCTTC 

CTCAAGCTCT 

AAAAGCATTA 

T CTAGATGTG 

TCTTCTGGTC 

11400 

AGAGACAGTA 

ATAGACGAAA 

CAAGATCAGC 

ATCTTCAAAT 

TTGGCAACAT 

CATTTGCAAA 

11460 

AGCAATTGAG 

AATGGTTATT 

CACCTGTTCT 

AGGATATGCA 

TGCTCAATTT 

TTAAGAATAT 

11520 

TCAACAACTA 

TATATTGCCC 

TTGGGATGAA 

TATCAATCCA 

ACTATAACAC 

AGAATATCAG 

11580 

AGATCAGTAT 

TTTAGGAATC 

CAAATTGGAT 

GCAATATGCC 

T CTTTAAT AC 

CTGCTAGTGT 

11640 

TGGGGGATTC 

AATTACATGG 

CCATGTCAAG 

ATGTTTTGTA 

AGGAATATTG 

GTGATCCATC 

11700 

AGTTGCCGCA 

TTGGCTGATA 

TTAAAAGATT 

TATTAAGGCG 

AATCTATTAG 

ACCGAAGTGT 

11760 

TCTTTATAGG 

ATTATGAATC 

AAGAACCAGG 

TGAGTCATCT 

TTTTTGGACT 

GGGCTTCAGA 

11820 

TCCATATTCA 

TGCAATTTAC 

CACAATCTCA 

AAATATAACC 

ACCATGATAA 

AAAATATAAC 

11880 

AGCAAGGAAT 

GTATTACAAG 

ATTCACCAAA 

TC C ATTATTA 

TCTGGATTAT 

TCACAAATAC 

11940 

AATGATAGAA 

GAAGATGAAG 

AATTAGCTGA 

GTTCCTGATG 

GACAGGAAGG 

TAATTCTCCC 

12000 

TAGAGTTGCA 

CATGATATTC 

TAGATAATTC 

TCTCACAGGA 

ATTAGAAATG 

CCATAGCTGG 

12060 

AATGTTAGAT 

ACGACAAAAT 

CACTAATTCG 

GGTTGGCATA 

AATAGAGGAG 

GACTGACATA 

12120 

TAGTTTGTTG 

AGGAAAATCA 

GTAATTACGA 

TCTAGTACAA 

TATGAAACAC 

TAAGTAGGAC 

12180 

TTTGCGACTA 

ATTGTAAGTG 

ATAAAATCAA 

GTATGAAGAT 

ATGTGTTCGG 

TAGAC CTTGC 

12240 

CAT AG C ATTG 

CGACAAAAGA 

TGTGGATTCA 

TTTATCAGGA 

GGAAGGATGA 

TAAGTGGACT 

12300 

TGAAACGCCT 

GACCCATTAG 

AATTACTATC 

TGGGGTAGTA 

ATAACAGGAT 

CAGAACATTG 

12360 

TAAAATATGT 

TATTCTTCAG 

ATGGCACAAA 

CCCATATACT 

TGGATGTATT 

TACCCGGTAA 

12420 

TATCAAAATA 

GGATCAGCAG 

AAACAGGTAT 

ATCGTCATTA 

AGAGTTCCTT 

ATTTTGGATC 

12480 

AGTCACTGAT 

GAAAGATCTG 

AAGCACAATT 

AGGATATATC 

AAGAATC TTA 

GTAAACCTGC 

12540 

AAAAGCCGCA 

ATAAGAATAG 

CAATGATATA 

TACATGGGCA 

TTTGGTAATG 

ATGAGATATC 

12600 

TTGGATGGAA 

GCCTCACAGA 

TAGCACAAAC 

ACGTGCAAAT 

TTTACACTAG 

ATAGTCTCAA 

12660 



155 


AATTTTAACA 
TCAGATGAAA 
TGATAACATG 
AATAATGTTA 
ACACAACC CT 
TAATGATGAA 
TGAATTTATT 
T AAAG AC CAT 
AATTTCAATA 
TTTAAAAGAG 
TTTGACTCTT 
TGCATACACT 
GAGAACACTG 
TCCTAAAGTA 
TACTGCTAGT 
ATTTATGAGA 
GGAAGTTG C A 
TTTAGCAGAA 
TGATCTATTG 
ACAAATATCT 
TGCAATCAAA 
TCCGGTAGAA 
TAAAGATAAT 
AGTCCTTAAA 
TACAAGTGGT 
CGGAATCAAC 
AGTCAATAAA 
TTATGATGC C 
TGTAATTGGT 
ATTAGGAAAT 
TTCAACATGG 
GTCCATTGGA 
TCTACATGAA 
TTTAGTTTCC 
TAAATTATAT 


CCGGTAGCTA 

TTCTCCAGTA 

TCTATCAAAG 

ACAGGATTAA 

ATAGTTATGC 

CATATTAATC 

TATGATAAAG 

TCTTACACAA 

TGTACTGCAA 

ATAATAGTTA 

GACATACTTG 

CTTTATAGTC 

AGAGATACTT 

TTCAAGAGGT 

CAAGACCAGA 

GAATGGTTGA 

AATGATAGGA 

ATTGCATCTT 

AAACAATATC 

GGATTATTAA 

TATCTAAGGA 

GATGAAAATA 

AAAGGGAATA 

ATCAGATCTA 

TTGACACTTC 

AGCACTAGTT 

GACAAGGACA 

ACATTAGGAC 

CAACGAGAAT 

GTGACACAGA 

ATAGGAAATA 

TTAGTACATT 

CATTATAGTG 

AAAATTATAC 

TGGAAAGATG 


CATCAACAAA 

CATCATTGAT 

AAGCTAATGA 

GTGTTTTCGA 

ATCTGCACAT 

CAGAGTCTAC 

ACCCACTCAA 

TTGATATGAA 

TTACAATAGC 

TTGCAAATGA 

TATTTCTCAA 

TAAAAATAGA 

CCCATTCAAT 

TCTGGGATTG 

TAAAACTTGC 

ATGGTGTATC 

AACAAGCCTT 

TCGGACCTAA 

TTGAATTAAA 

TTAAATCGTT 

TTCGCGGTAT 

TGCTGGATAA 

AAATTAACAA 

TAACAAGTGA 

CTCAAGGAGG 

GTCTGAAAGC 

GGCTCTTCCT 

CTGCAGTTAA 

TGAAAATATT 

TTCTTAACAG 

TGGAATGTGA 

GTGATATGGA 

TTATAAGAAT 

CTACAATCAC 

TAAGTATAAT 


TTT AT C AC AC 
CAGAGTCAGC 
AACCAAAGAT 
ATATTTATTT 
AGAAGATGAG 
ATTAGAATTA 
AGATGTGGAC 
TTATTGGGAT 
AGATACTATG 
TGATGATATT 
GACATTTGGT 
AGGTAGGGAT 
ATTAAAAGTA 
TGGAGTTTTA 
CCTATCTATA 
ACTTGAAATA 
TATTTCTAGA 
CCTGTTAAAC 
TATTAAAGAA 
CCCATCAACT 
TAGTCCACCT 
CATTGTCAAA 
TTTCTGGGGA 
TTCTGATGAT 
GAATTATCTA 
TCTTGAGTTA 
GGGAGAAGGA 
TTATTATAAT 
TCCTTCAGAG 
GGTAAAAGTA 
GAGCTTAATA 
AGGAGCTATC 
TACATACTTG 
TCCGAATTGG 
ATCACTCAAA 


AGATTAAAGG 

AGATTCATAA 

ACTAATCTTA 

AGATTAAAAG 

TGTTGTATTA 

ATTCGATATC 

TTATCAAAAC 

GATACTGACA 

TCACAATTAG 

AATAGCTTAA 

GGATTATTAG 

CTCATTTGGG 

TTATCTAATG 

AACCCTATTT 

TGTGAATATT 

TACATTTGTG 

CACCTTTCAT 

TTAACATACT 

GACCCTACTC 

GTAACATACG 

GAGGTAATTG 

ACTATAAATG 

CTAGCACTTA 

AATGATAGAC 

TCGCATCAAT 

TCACAAATTT 

GCAGGAGCTA 

TCAGGTTTGA 

GTATCATTAG 

CTGTTCAATG 

TGGAGTGAAT 

GGTAAATCAG 

ATTGGGGATG 

TCTAGAATAC 

ACTTCTAATC 


ATACTGCAAC 

CAATGTCCAA 

TTTATCAACA 

AAACCACAGG 

AAGAAAGTTT 

CTGAAAGTAA 

TTATGGTTAT 

TCATACATGC 

ATCGAGATAA 

TCACTGAATT 

TAAATCAATT 

ATTATATAAT 

CATTATCTCA 

ATGGTCCTAA 

CACTAGATCT 

ACAGCGATAT 

TTGTTTGTTG 

TGGAGAGACT 

TTAAATATGT 

TAAGAAAGAC 

ATGATTGGGA 

ATAACTGTAA 

AGAACTATCA 

TAGATGCTAA 

TGAGATTATT 

TAATGAAGGA 

TGCTAGCATG 

ATATAACAGA 

TAGGTAAAAA 

GGAATCCTAA 

TAAATGATAA 

AAGAAACTGT 

ATGATGTTGT 

TTTATCTATA 

CTGCATCAAC 


12720 

12780 

12840 

12900 

12960 

13020 

13080 

13140 

13200 

13260 

13320 

13380 

13440 

13500 

13560 

13620 

13680 

13740 

13800 

13860 

13920 

13980 

14040 

14100 

14160 

14220 

14280 

14340 

14400 

14460 

14520 

14580 

14640 

14700 


14760 



156 


AGAATTATAT CTAATTTCGA AAGATGCATA TTGTACTATA ATGGAACCTA GTGAAATTGT 14820 
TTTATCAAAA CTTAAAAGAT TGTCACTCTT GGAAGAAAAT AATCTATTAA AATGGATCAT 14880 
TTTATCAAAG AAGAGGAATA ATGAATGGTT ACATCATGAA ATCAAAGAAG GAGAAAGAGA 14940 
TTATGGAATC ATGAGACCAT ATCATATGGC ACTACAAATC TTTGGATTTC AAATCAATTT 15000 
AAATCATCTG GCGAAAGAAT TTTTATCAAC CCCAGATCTG ACTAATATCA ACAATATAAT 15060 
CCAAAGTTTT CAGCGAACAA TAAAGGATGT TTTATTTGAA TGGATTAATA TAACTCATGA 15120 
TGATAAGAGA CATAAATTAG GCGGAAGATA TAACATATTC CCACTGAAAA ATAAGGGAAA 15180 
GTTAAGACTG CTATCGAGAA GACTAGTATT AAGTTGGATT TCATTATCAT TATCGACTCG 15240 
ATTACTTACA GGTCGCTTTC CTGATGAAAA ATTTGAACAT AGAGCACAGA CTGGATATGT 15300 
ATCATTAGCT GATACTGATT TAGAATCATT AAAGTTATTG TCGAAAAACA TCATTAAGAA 15360 
TTACAGAGAG TGTATAGGAT CAATATCATA TTGGTTTCTA ACCAAAGAAG TTAAAATACT 15420 
TATGAAATTG ATCGGTGGTG CTAAATTATT AGGAATTCCC AGACAATATA AAGAACCCGA 15480 
AGACCAGTTA TTAGAAAACT ACAATCAACA TGATGAATTT GATATCGATT AAAACATAAA 15540 
TACAATGAAG ATATATCCTA ACCTTTATCT TTAAGCCTAG GAATAGACAA AAAGTAAGAA 15600 
AAACATGTAA TATATATATA CCAAACAGAG TTCTTCTCTT GTTTGGTTAT AGTGAGTCGT 15660 
ATTACAATC 15669 


(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

AATACGACTC ACTATAACCA AACAAGAGAA C 31 

(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

CCAAGTACTA TGAGATGCTT CATT 24 

(2) INFORMATION FOR SEQ ID NO:4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO:4: 

CCCTATAATT TCAACATGTT GAGCCTATTT G 31 

(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

GATTAAAATG TTGGTCGACT TAGTTGCTTC C 31 

(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 

CCATAGAGAG TC CATGGAAA GCGACGCTAA AAACTATC 38 

(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

CGGTGTCGTT TCTTTGTCGA CTCATTGGCA ATTGTTG 37 

(2) INFORMATION FOR SEQ ID NO : 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: CDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 
GCAAAGCGTG CCCGGGCCAT GGACACTGAA TCTAACAATG GC 
(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 
GAAATTCCTT AATCGATTCT CTAGATTC 
(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
CCCATCAACT GTAACATACG TAAGAAAGAC 
(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
GGTTAGGATA TGTCGACATT GTATTTATG 
(2) INFORMATION FOR SEQ ID NO : 12 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 43 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 12 : 
GGGGTTATGC TACTGCAGGC TTTTTTTCTC CCTTAGCCAT CCG 



(2) INFORMATION FOR SEQ ID NO: 13: 


(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

CTCCATTCTA GANTTATAAA AATTATAGAG TTCCC 35 

(2) INFORMATION FOR SEQ ID NO : 14 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15660 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 


TAATACGACT 

CACTATAACC 

AAACAAGAGA 

AGAAACTTGT 

CTGGGAATAT 

AAATTTAACT 

60 

TTAAATTAAC 

TTAGGATTAA 

AGACATTGAC 

TAGAAGGTCA 

AGAAAAGGGA 

ACTCTATAAT 

120 

TTCAAAAATG 

TTGAGCCTAT 

TTGATACATT 

TAATGCACGT 

AGGCAAGAAA 

ACATAACAAA 

180 

ATCAGCCGGT 

GGAGCTATCA 

TTCCTGGACA 

GAAAAATACT 

GTCTCTATAT 

TCGCCCTTGG 

240 

ACCGACAATA 

ACTGATGATA 

ATGAGAAAAT 

GACATTAGCT 

CTTCTATTTC 

TATCTCATTC 

300 

ACTAGATAAT 

GAGAAACAAC 

ATGCACAAAG 

GGCAGGGTTC 

TTGGTGTCTT 

TATTGTCAAT 

360 

GGCTTATGCC 

AATC C AGAGC 

TCTACCTAAC 

AACAAATGGA 

AGTAATGCAG 

ATGTCAAGTA 

420 

TGT CAT AT AC 

ATGATTGAGA 

AAGATCTAAA 

ACGGCAAAAG 

TATGGAGGAT 

TTGTGGTTAA 

480 

GACGAGAGAG 

ATGATATATG 

AAAAGACAAC 

TGATTGGATA 

TTTGGAAGTG 

ACCTGGATTA 

540 

TGATCAGGAA 

ACTATGTTGC 

AGAACGGCAG 

GAACAATTCA 

ACAATTGAAG 

ACCTTGTCCA 

600 

CACATTTGGG 

TATCCATCAT 

GTTTAGGAGC 

TCTTATAATA 

CAGATCTGGA 

TAGTTCTGGT 

660 

CAAAGCTATC 

ACTAGTATCT 

CAGGGTTAAG 

AAAAGGCTTT 

TTCACCCGAT 

TGGAAGCTTT 

720 

CAGACAAGAT 

GGAACAGTGC 

AGGCAGGGCT 

GGTATTGAGC 

GGTGACACAG 

TGGATCAGAT 

780 

TGGGTCAATC 

ATGCGGTCTC 

AACAGAGCTT 

GGTAACTCTT 

ATGGTTGAAA 

CATTAATAAC 

840 

AATGAATACC 

AGCAGAAATG 

ACCTCACAAC 

CATAGAAAAG 

AATATACAAA 

TTGTTGGCAA 

900 

CTACATAAGA 

GATGCAGGTC 

TCGCTTCATT 

CTTCAATACA 

ATCAGATATG 

GAATTGAGAC 

960 

CAGAATGGCA 

GCTTTGACTC 

TATCCACTCT 

CAGACCAGAT 

ATCAATAGAT 

TAAAAGCTTT 

1020 

GATGGAACTG 

TATTTATCAA 

AGGGACCACG 

CGCTCCTTTC 

ATCTGTATCC 

TCAGAGATC C 

1080 

TATACATGGT 

GAGTTCGCAC 

CAGGCAACTA 

TCCTGCCATA 

TGGAGCTATG 

CAATGGGGGT 

1140 

GGCAGTTGTA 

CAAAATAGAG 

CCATGCAACA 

GTATGTGACG 

GGAAGATCAT 

ATCTAGACAT 

1200 



TGATATGTTC 

CAGCTAGGAC 

AAGCAGTAGC 

ACGTGATGCC 

GAAGCTCAAA 

TGAGCTCAAC 

1260 

ACTGGAAGAT 

GAACTTGGAG 

TGACACACGA 

ATCTAAAGAA 

AGCTTGAAGA 

GACATATAAG 

1320 

GAACATAAAC 

AGTTCAGAGA 

CATCTTTCCA 

CAAAC CGACA 

GGTGGATCAG 

C CAT AG AG AT 

1380 

GGCAATAGAT 

GAAGAGCCAG 

AACAATTCGA 

AC AT AG AG C A 

GATCAAGAAC 

AAAATGGAGA 

1440 

ACCTCAATCA 

TCCATAATTC 

AATATGC CTG 

GGCAGAAGGA 

AATAGAAGCG 

ATGATCAGAC 

1500 

TGAGCAAGCT 

ACAGAATCTG 

ACAATATCAA 

GACCGAACAA 

CAAAACATCA 

GAGACAGACT 

1560 

AAACAAGAGA 

CTCAACGACA 

AGAAGAAACA 

AAGCAGTCAA 

CCACCCACTA 

AT C C C AC AAA 

1620 

CAGAACAAAC 

CAGGACGAAA 

TAGATGATCT 

GTTTAACGCA 

TTTGGAAGCA 

ACTAATCGAA 

1680 

TCAACATTTT 

AATCTAAATC 

AATAATAAAT 

AAGAAAAACT 

TAGGATTAAA 

GAATCCTATC 

1740 

AT AC CGGAAT 

ATAGGGTGGT 

AAATTTAGAG 

TCTGCTTGAA 

ACTCAATCAA 

TAGAGAGTTG 

1800 

ATGGAAAGCG 

ATGCTAAAAA 

CTATCAAATC 

ATGGATTCTT 

GGGAAGAGGA 

ATCAAGAGAT 

1860 

AAATCAACTA 

ATATCTCCTC 

GGCCCTCAAC 

ATCATTGAAT 

TCATACTCAG 

CACCGACCCC 

1920 

CAAGAAGACT 

TATCGGAAAA 

CGACACAATC 

AACACAAGAA 

CCCAGCAACT 

CAGTGCCACC 

1980 

ATCTGTCAAC 

CAGAAATCAA 

ACCAACAGAA 

ACAAGTGAGA 

AAGATAGTGG 

ATCAACTGAC 

2040 

AAAAATAGAC 

AGTCTGGGTC 

ATCACACGAA 

TGTACAACAG 

AAGCAAAAGA 

TAGAAATATT 

2100 

GATCAGGAAA 

CTGTACAGAG 

AGGACCTGGG 

AGAAGAAGCA 

GCTCAGATAG 

TAGAGCTGAG 

2160 

ACTGTGGTCT 

CTGGAGGAAT 

CCCCAGAAGC 

ATCACAGATT 

CTAAAAATGG 

AACCCAAAAC 

2220 

ACGGAGGATA 

TTGATCTCAA 

TGAAATTAGA 

AAGATGGATA 

AGGACTCTAT 

TGAGGGGAAA 

2280 

ATGCGACAAT 

CTGCAAATGT 

TCCAAGCGAG 

ATATCAGGAA 

GTGATGACAT 

ATTTACAACA 

2340 

GAACAAAGTA 

GAAACAGTGA 

TCATGGAAGA 

AGCCTGGAAT 

CTATCAGTAC 

ACCTGATACA 

2400 

AGATCAATAA 

GTGTTGTTAC 

TGCTGCAACA 

CCAGATGATG 

AAGAAGAAAT 

ACTAATGAAA 

2460 

AATAGTAGGA 

CAAAGAAAAG 

TTCTTCAACA 

CATCAAGAAG 

ATGACAAAAG 

AATTAAAAAA 

2520 

GGGGGAAAAG 

GGAAAGACTG 

GTTTAAGAAA 

TCAAAAGATA 

CCGACAACCA 

GATACCAACA 

2580 

TCAGACTACA 

GATCCACATC 

AAAAGGGCAG 

AAGAAAATCT 

CAAAGACAAC 

AACCACCAAC 

2640 

ACCGACACAA 

AGGGGCAAAC 

AGAAATACAG 

ACAGAATCAT 

CAGAAACACA 

ATCCTCATCA 

2700 

TGGAATCTCA 

TCATCGACAA 

CAACACCGAC 

CGGAACGAAC 

AGACAAGCAC 

AACTCCTCCA 

2760 

ACAACAACTT 

CCAGATCAAC 

TTATACAAAA 

GAATCGATCC 

GAACAAACTC 

TGAATCCAAA 

2820 

CCCAAGACAC 

AAAAGACAAA 

TGGAAAGGAA 

AGGAAGGATA 

CAGAAGAGAG 

CAATCGATTT 

2880 

ACAGAGAGGG 

CAATTACTCT 

ATTGCAGAAT 

CTTGGTGTAA 

TTCAATCCAC 

ATCAAAACTA 

2940 

GATTTATATC 

AAGACAAACG 

AGTTGTATGT 

GTAGCAAATG 

TACTAAACAA 

TGTAGATACT 

3000 

GCATCAAAGA 

TAGATTTCCT 

GGCAGGATTA 

GTCATAGGGG 

TTTCAATGGA 

CAACGACACA 

3060 

AAATTAACAC 

AGATACAAAA 

TGAAATGCTA 

AACCTCAAAG 

CAGATCTAAA 

GAAAATGGAC 

3120 

GAATCACATA 

GAAGATTGAT 

AGAAAATCAA 

AGAGAACAAC 

TGTCATTGAT 

CACGTCACTA 

3180 

ATTTCAAATC 

TCAAAATTAT 

GACTGAGAGA 

GGAGGAAAGA 

AAGACCAAAA 

TGAATCCAAT 

3240 

GAGAGAGTAT 

CCATGATCAA 

AACAAAATTG 

AAAGAAGAAA 

AG AT C AAGAA 

GACCAGGTTT 

3300 
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GAC CCACTTA 

TGGAGGCACA 

AGGCATTGAC 

AAGAATATAC 

CCGATCTATA 

TCGACATGCA 

3360 

GGAGATACAC 

TAGAGAACGA 

TGTACAAGTT 

AAATCAGAGA 

TATTAAGTTC 

ATACAATGAG 

3420 

TCAAATGCAA 

CAAGACTAAT 

ACCCAAAAAA 

GTGAGCAGTA 

CAATGAGATC 

ACTAGTTGCA 

3480 

GTCATCAACA 

ACAGCAATCT 

CTCACAAAGC 

ACAAAACAAT 

CATACATAAA 

CGAACTCAAA 

3540 

CGTTGCAAAA 

ATGATGAAGA 

AGTATCTGAA 

TTAATGGACA 

TGTTCAATGA 

AGATGTCAAC 

3600 

AATTGC CAAT 

GATCCAACAA 

AGAAACGACA 

CCGAACAAAC 

AGACAAGAAA 

CAACAGTAGA 

3660 

TCAAAACCTG 

TCAACACACA 

CAAAATCAAG 

CAGAATGAAA 

CAACAGATAT 

CAATCAATAT 

3720 

ACAAATAAGA 

AAAACTTAGG 

ATTAAAGAAT 

AAATTAATCC 

TTGTCCAAAA 

TGAGTATAAC 

3780 

TAACTCTGCA 

ATATACACAT 

TCCCAGAATC 

ATCATTCTCT 

GAAAATGGTC 

AT AT AG AAC C 

3840 

ATTACCACTC 

AAAGTCAATG 

AACAGAGGAA 

AGCAGTACCC 

CACATTAGAG 

TTGC CAAGAT 

3900 

CGGAAATCCA 

CCAAAACACG 

GATC CCGGTA 

TTT AGATGT C 

TTCTTACTCG 

GCTTCTTCGA 

3960 

GATGGAACGA 

ATCAAAGACA 

AATACGGGAG 

TGTGAATGAT 

CTCGACAGTG 

ACCCGAGTTA 

4020 

CAAAGTTTGT 

GGCTCTGGAT 

CATTACCAAT 

CGGATTGGCT 

AAGTACACTG 

GGAATGAC C A 

4080 

GGAATTGTTA 

CAAGCCGCAA 

CCAAACTGGA 

TATAGAAGTG 

AGAAGAACAG 

TCAAAGCGAA 

4140 

AGAGATGGTT 

GTTTACACGG 

TACAAAATAT 

AAAACCAGAA 

CTGTACCCAT 

GGTCCAATAG 

4200 

ACTAAGAAAA 

GGAATGCTGT 

TCGATGCCAA 

CAAAGTTGC T 

CTTGCTCCTC 

AATGTCTTCC 

4260 

ACTAGATAGG 

AGCATAAAAT 

TTAGAGTAAT 

CTTCGTGAAT 

TGTACGGCAA 

TTGGATCAAT 

4320 

AACCTTGTTC 

AAAATTC CTA 

AGTCAATGGC 

ATCACTATCT 

CTACCCAACA 

CAATATCAAT 

4380 

CAATCTGCAG 

GTACACATAA 

AAACAGGGGT 

TCAGACTGAT 

TCTAAAGGGA 

TAGTTCAAAT 

4440 

TTTGGATGAG 

AAAGGCGAAA 

AATCACTGAA 

TTTCATGGTC 

CATCTCGGAT 

TGATCAAAAG 

4500 

AAAAGTAGGC 

AGAATGTACT 

CTGTTGAATA 

CTGTAAACAG 

AAAATCGAGA 

AAATGAGATT 

4560 

GATATTTTCT 

TT AGG AC TAG 

TTGGAGGAAT 

CAGTCTTCAT 

GTCAATGCAA 

CTGGGTCCAT 

4620 

ATCAAAAACA 

CTAGCAAGTC 

AGCTGGTATT 

CAAAAGAGAG 

ATTTGTTATC 

CTTTAATGGA 

4680 

TCTAAATCCG 

CATCTCAATC 

TAGTTATCTG 

GGCTTCATCA 

GTAGAGATTA 

CAAGAGTGGA 

4740 

TGCAATTTTC 

CAACCTTCTT 

TACCTGGCGA 

GTTCAGATAC 

TATCCTAATA 

TTATTGCAAA 

4800 

AGGAGTTGGG 

AAAATCAAAC 

AATGGAACTA 

GTAATCTCTA 

TTTTAGTC CG 

GACGTATCTA 

4860 

TTAAGCCGAA 

GCAAATAAAG 

GATAATCAAA 

AACTTAGGAC 

AAAAGAGGTC 

AATACCAACA 

4920 

ACTATTAGCA 

GTCACACTCG 

CAAGAATAAG 

AGAGAAGGGA 

CCAAAAAAGT 

CAAATAGGAG 

4980 

AAATCAAAAC 

AAAAGGTACA 

GAACACCAGA 

ACAACAAAAT 

CAAAACATCC 

AACTCACTCA 

5040 

AAACAAAAAT 

TCCAAAAGAG 

ACCGGCAACA 

CAACAAGCAC 

TGAACACAAT 

GCCAACTTCA 

5100 

ATACTGCTAA 

TTATTACAAC 

CATGATCATG 

GCATCTTTCT 

GCCAAATAGA 

TATCACAAAA 

5160 

CTACAGCACG 

TAGGTGTATT 

GGTCAACAGT 

CCCAAAGGGA 

TGAAGATATC 

ACAAAACTTT 

5220 

GAAACAAGAT 

ATCTAATTTT 

GAGCCTCATA 

CCAAAAATAG 

AAGACTCTAA 

CTCTTGTGGT 

5280 

GACCAACAGA 

TCAAGCAATA 

CAAGAAGTTA 

TTGGATAGAC 

TGATCATCCC 

TTTATATGAT 

5340 

GGATTAAGAT 

TACAGAAAGA 

TGTGATAGTA 

ACCAATCAAG 

AATCCAATGA 

AAACACTGAT 

5400 
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CCCAGAACAA 

AACGATTCTT 

TGGAGGGGTA 

ATTGGAACCA 

TTGCTCTGGG 

AGTAGCAACC 

5460 

T C AG C AC AAA 

TTACAGCGGC 

AGTTGCTCTG 

GTTGAAGCCA 

AGCAGGCAAG 

ATCAGACATC 

5520 

GAAAAACTCA 

AAGAAGCAAT 

TAGGGACACA 

AACAAAGCAG 

TGCAGTCAGT 

TCAGAGCTCC 

5580 

ATAGGAAATT 

TAATAGTAGC 

AATTAAATCA 

GTCCAGGATT 

ATGTTAACAA 

AGAAATCGTG 

5640 

CCATCGATTG 

CGAGGCTAGG 

TTGTGAAGCA 

GCAGGACTTC 

AATTAGGAAT 

TGCATTAACA 

5700 

CAGCATTACT 

CAGAATTAAC 

AAACATATTT 

GGTGATAACA 

TAGGATCGTT 

ACAAGAAAAA 

5760 

GGAATAAAAT 

TACAAGGTAT 

AGCATCATTA 

TACCGCACAA 

ATATCACAGA 

AATATTCACA 

5820 

ACATCAACAG 

TTGATAAATA 

TGATATCTAT 

GATCTGTTAT 

TTACAGAATC 

AATAAAGGTG 

5880 

AGAGTTATAG 

ATGTTGACTT 

GAATGATTAC 

TCAATCACCC 

TCCAAGTCAG 

ACTCCCTTTA 

5940 

TTAACTAGGC 

TGCTGAACAC 

TCAGATCTAC 

AAAGTAGATT 

CCATATCATA 

TAACATCCAA 

6000 

AACAGAGAAT 

GGTATATCCC 

TCTTCCCAGC 

CAT AT C ATGA 

CGAAAGGGGC 

ATTTCTAGGT 

6060 

GGAGCAGACG 

TCAAAGAATG 

TATAGAAGCA 

TTCAGCAGCT 

ATATATGCCC 

TTCTGATCCA 

6120 

GGATTTGTAT 

TAAACCATGA 

AATAGAGAGC 

TGCTTATCAG 

GAAACATATC 

CCAATGTC CA 

6180 

AGAACAACGG 

TCACATCAGA 

CATTGTTCCA 

AGATATGCAT 

TTGTCAATGG 

AGGAGTGGTT 

6240 

GCAAACTGTA 

TAACAACCAC 

CTGTACATGC 

AACGGAATTG 

GTAATAGAAT 

CAATCAACCA 

6300 

CCTGATCAAG 

GAGTAAAAAT 

TATAACACAT 

AAAGAATGTA 

GTACAATAGG 

TATCAACGGA 

6360 

ATGCTGTTCA 

ATACAAATAA 

AGAAGGAACT 

CTTGCATTCT 

AT AC AC C AAA 

TGATATAACA 

6420 

CTAAACAATT 

CTGTTGCACT 

TGATCCAATT 

GACATATCAA 

TCGAGCTCAA 

CAAGGCCAAA 

6480 

TCAGATCTAG 

AAGAATCAAA 

AGAATGGATA 

AGAAGGTCAA 

ATCAAAAACT 

AGATTCTATT 

6540 

GGAAATTGGC 

ATCAATCTAG 

CACTACAATC 

ATAATTATTT 

TGATAATGAT 

CATTATATTG 

6600 

TTTATAATTA 

ATATAACGAT 

AATTACAATT 

GCAATTAAGT 

ATTACAGAAT 

TCAAAAGAGA 

6660 

AATCGAGTGG 

ATCAAAATGA 

CAAGC CAT AT 

GTACTAACAA 

ACAAATAACA 

TATCTACAGA 

6720 

TCATTAGATA 

TTAAAATTAT 

AAAAAACTTA 

GGAGTAAAGT 

TACGCAATCC 

AACTCTACTC 

6780 

ATATAATTGA 

GGAAGGACCC 

AATAGACAAA 

TCCAAATTCG 

AGATGGAATA 

CTGGAAGCAT 

6840 

ACCAATCACG 

GAAAGGATGC 

TGGTAATGAG 

CTGGAGACGT 

CTATGGCTAC 

TCATGGCAAC 

6900 

AAGCTCACTA 

ATAAGATAAT 

ATACATATTA 

TGGACAATAA 

TCCTGGTGTT 

ATTATCAATA 

6960 

GTCTTCATCA 

TAGTGCTAAT 

TAATTCCATC 

AAAAGTGAAA 

AGGCCCACGA 

ATCATTGCTG 

7020 

CAAGACATAA 

ATAATGAGTT 

TATGGAAATT 

ACAGAAAAGA 

TCCAAATGGC 

ATCGGATAAT 

7080 

ACCAATGATC 

TAATACAGTC 

AGGAGTGAAT 

ACAAGGCTTC 

TTACAATTCA 

GAGTCATGTC 

7140 

CAGAATTACA 

TACCAATATC 

ATTGACACAA 

CAGATGTCAG 

ATCTTAGGAA 

ATTCATTAGT 

7200 

GAAATTACAA 

TTAGAAATGA 

TAATCAAGAA 

GTGCTGCCAC 

AAAGAATAAC 

ACATGATGTA 

7260 

GGTATAAAAC 

CTTTAAATCC 

AGATGATTTT 

TGGAGATGCA 

CGTCTGGTCT 

TCCATCTTTA 

7320 

ATGAAAACTC 

CAAAAATAAG 

GTTAATGCCA 

GGGCCGGGAT 

T ATT AGC TAT 

GCCAACGACT 

7380 

GTTGATGGCT 

GTGTTAGAAC 

TCCGTCTTTA 

GTTATAAATG 

ATCTGATTTA 

TGCTTATACC 

7440 

TCAAATCTAA 

TT ACT CG AGG 

TTGTCAGGAT 

ATAGGAAAAT 

CATATCAAGT 

CTTACAGATA 

7500 
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GGGATAATAA 

CTGTAAACTC 

AGACTTGGTA 

CCTGACTTAA 

ATCCTAGGAT 

CTCTCATACC 

7560 

TTTAACATAA 

ATGACAATAG 

GAAGTCATGT 

TCTCTAGCAC 

TCCTAAATAT 

AGATGTATAT 

7620 

CAACTGTGTT 

CAACTCCCAA 

AGTTGATGAA 

AGATCAGATT 

ATGCATCATC 

AGGCATAGAA 

7680 

GATATTGTAC 

TTGATATTGT 

CAATTATGAT 

GGTTCAATCT 

CAACAACAAG 

ATTTAAGAAT 

7740 

AATAACATAA 

GCTTTGATCA 

ACCATATGCT 

GCACTATACC 

CATCTGTTGG 

ACCAGGGATA 

7800 

TACTACAAAG 

GCAAAATAAT 

ATTTCTCGGG 

TATGGAGGTC 

TTGAAC ATC C 

AATAAATGAG 

7860 

AATGTAATCT 

GCAACACAAC 

TGGGTGCCCC 

GGGAAAACAC 

AGAGAGACTG 

TAATCAAGCA 

7920 

TCTCATAGTA 

CTTGGTTTTC 

AGATAGGAGG 

ATGGTCAACT 

CCATCATTGT 

TGTTGACAAA 

7980 

GGCTTAAACT 

CAATTCCAAA 

ATTGAAAGTA 

TGGACGATAT 

CTATGCGACA 

AAATTACTGG 

8040 

GGGTCAGAAG 

GAAGGTTACT 

TCTACTAGGT 

AACAAGATCT 

ATATATATAC 

AAGATCTACA 

8100 

AGTTGGCATA 

GCAAGTTACA 

ATTAGGAATA 

ATTGATATTA 

CTGATTACAG 

TGATATAAGG 

8160 

ATAAAATGGA 

CATGGCATAA 

TGTGCTATCA 

AGACCAGGAA 

ACAATGAATG 

TCCATGGGGA 

8220 

CATTCATGTC 

CAGATGGATG 

TATAACAGGA 

GTATATACTG 

ATGCATATCC 

ACTCAATCCC 

8280 

ACAGGGAGCA 

TTGTGTCATC 

TGTCATATTA 

GACTCACAAA 

AATCGAGAGT 

GAACCCAGTC 

8340 

ATAACTTACT 

CAACAGCAAC 

CGAAAGAGTA 

AACGAGCTGG 

CCATCCTAAA 

CAGAACACTC 

8400 

TCAGCTGGAT 

ATACAACAAC 

AAGCTGCATT 

ACACACTATA 

ACAAAGGATA 

TTGTTTTCAT 

8460 

ATAGTAGAAA 

TAAATCATAA 

AAGCTTAAAC 

ACATTTCAAC 

CCATGTTGTT 

CAAAACAGAG 

8520 

ATTCCAAAAA 

GCTGCAGTTA 

ATCATAATTA 

ACCATAATAT 

GCATCAATCT 

ATCTATAATA 

8580 

CAAGTATATG 

ATAAGTAATC 

AGCAATCAGA 

CAATAGACAA 

AAGGGAAATA 

TAAAAAACTT 

8640 

AGGAGCAAAG 

CGTGCTCGGG 

AAATGGACAC 

TGAATCTAAC 

AATGGCACTG 

TATCTGACAT 

8700 

ACTCTATCCT 

GAGTGTCACC 

TTAACTCTCC 

TATCGTTAAA 

GGTAAAATAG 

CACAATTACA 

8760 

CACTATTATG 

AGTCTACCTC 

AGO CTTATGA 

TATGGATGAC 

GACTCAATAC 

TAGTTATCAC 

8820 

TAGACAGAAA 

ATAAAACTTA 

ATAAATTGGA 

TAAAAGACAA 

CGATCTATTA 

GAAGATTAAA 

8880 

ATTAATATTA 

ACTGAAAAAG 

TGAATGACTT 

AGGAAAATAC 

ACATTTATCA 

GATATCCAGA 

8940 

AATGTCAAAA 

GAAATGTTCA 

AATTATATAT 

ACCTGGTATT 

AACAGTAAAG 

TGACTGAATT 

9000 

ATTACTTAAA 

GCAGATAGAA 

CATATAGTCA 

AATGACTGAT 

GGATTAAGAG 

ATCTATGGAT 

9060 

TAATGTGCTA 

TCAAAATTAG 

CCTCAAAAAA 

TGATGGAAG C 

AATTATGATC 

TTAATGAAGA 

9120 

AATTAATAAT 

ATATCGAAAG 

TTCACACAAC 

CTATAAATCA 

GATAAATGGT 

ATAATCCATT 

9180 

CAAAACATGG 

TTTACTATCA 

AGTATGATAT 

GAGAAGATTA 

CAAAAAGCTC 

GAAATGAGAT 

9240 

CACTTTTAAT 

GTTGGGAAGG 

ATTATAACTT 

GTTAGAAGAC 

CAGAAGAATT 

TCTTATTGAT 

9300 

ACATCCAGAA 

TTGGTTTTGA 

TATTAGATAA 

ACAAAACTAT 

AATGGTTATC 

TAATTACTCC 

9360 

TGAATTAGTA 

TTGATGTATT 

GTGACGTAGT 

CGAAGGCCGA 

TGGAATATAA 

GTGCATGTGC 

9420 

TAAGTTAGAT 

CCAAAATTAC 

AATCTATGTA 

TCAGAAAGGT 

AATAACCTGT 

GGGAAGTGAT 

9480 

AGATAAATTG 

TTTCCAATTA 

TGGGAGAAAA 

GACATTTGAT 

GTGATATCGT 

TATTAGAACC 

9540 

ACTTGCATTA 

TCCTTAATTC 

AAACTCATGA 

TCCTGTTAAA 

CAACTAAGAG 

GAGCTTTTTT 

9600 
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AAATCATGTG 
TCTGAGTGTA 
AATAGCAGAG 
TGCAGCAGAA 
TAATAAATGT 
TGGACAGTGG 
CGGTTCAAAC 
AATAAAATTC 
AGATAAAGCA 
ACTGTACCGT 
TAGTAAATTT 
TCCAGAATTT 
CTTTGCAAAA 
AAATAACATA 
TAAGAGATTA 
TTCTAAAAGC 
TTCTAATCAG 
CGAGACTGTG 
TGAATCAACA 
TAATTGGTTA 
TC CATCAG AT 
TAACCCAAGA 
TGCAATACAT 
CAATCAAGCT 
GGAGATAGTT 
TCTAGGTCAT 
CAAAAGAATC 
TGTCTTCTGG 
ATCATTTGCA 
TTTTAAGAAT 
ACAGAATATC 
ACCTGCTAGT 
TGGTGATC C A 
AGACCGAAGT 
CTGGGCTTCA 


TTATCCGAGA 

GATTACATTG 

ATTTTCTCTT 

AAGGTTAGAA 

CATGCTATCT 

CCTCCTGTGA 

TCTGCGATAT 

AATAAATTCA 

TTATCTCCAA 

ACTAACGCAT 

GATCCTCATC 

AATATTTCTT 

ATGACATACA 

GGAAAATTCT 

ACAACCATAT 

CATACAGATG 

AAATCAAAGA 

AGCTGTTTCC 

GCTCTATTTG 

CACCCTCGTC 

AAAGAACATA 

GGGGGTATAG 

CTAGCAGCTG 

ATAGCTGTAA 

TATAAAGATG 

GAACTTAAAT 

TATTATGATG 

TCAGAGACAG 

AAAGCAATTG 

ATTCAACAAC 

AGAGATCAGT 

GTTGGGGGAT 

TCAGTTGCCG 

GTTCTTTATA 

GATCCATATT 


TGGAATTAAT 
ATAAAATTTT 
TTTTTAGAAC 
AATATATGTA 
TCTGTACAAT 
CATTACCTGA 
CATATGAAAA 
TAGAGCCTCA 
AAAAATCAAA 
CCAACGAATC 
AGATATTGGA 
ATAGTCTTAA 
AAATGAGAGC 
TTCAAGAAAA 
CAATATCAGG 
ACCTTAAAAC 
AATTTGAATT 
TAACAACAGA 
GAGAAACTTG 
TTGAAGGAAG 
TATCATTAGA 
AAGGATTTTG 
TTAGAATA GG 
CCACAAGAGT 
TAGTGAGATT 
TAAATGAAAC 
GGAGAATTCT 
TAATAGACGA 
AGAATGGTTA 
TATATATTGC 
ATTTTAGGAA 
TCAATTACAT 
CATTGGCTGA 
GGATTATGAA 
CATGCAATTT 


ATTTGAATCT 
AGATATATTT 
ATTTGGGCAT 
TAT TGGAAAA 
AATAATTAAC 
TCATGCACAC 
TGCTGTTGAT 
GTTAGATGAG 
TTGGGACACA 
ACGAAGATTA 
TTATGTAGAA 
AGAAAAAGAG 
TACACAAGTT 
TGGGATGGTG 
AGTTCCACGG 
CTACAATAAA 
CAAGTCAACG 
TCTCAAAAAA 
CAACCAAATA 
TACAATCTAT 
GGATCACCCT 
TCAAAAATTA 
CGTGAGGGTG 
ACCCAACAAT 
TTTTGATTCA 
GATTATAAGT 
TCCTCAAGCT 
AACAAGATCA 
TTCACCTGTT 
CCTTGGGATG 
TCCAAATTGG 
GGCCATGTCA 
TATTAAAAGA 
TCAAG AAC C A 
ACCACAATCT 


AGAGAATCGA 

AATAAGTCTA 

CCTCCATTAG 

CAATTAAAAT 

GGATATAGAG 

GAATTCATCA 

TATTACCAGA 

GATTTGACAA 

GTTTATCCTG 

GTTGAAGTAT 

TCTGGGGACT 

ATCAAACAGG 

TTATCAGAGA 

AAGGGAGAGA 

TATAATGAAG 

ATAAGTAATC 

GATATCTACA 

TACTGTCTTA 

TTTGGATTAA 

GTAGGTGATC 

GATTCTGGTT 

TGGACACTCA 

ACTGCAATGG 

TATGACTACA 

TTAAGAGAAG 

AGCAAGATGT 

CTAAAAGCAT 

GCATCTTCAA 

CTAGGATATG 

AATATCAATC 

ATGCAATATG 

AGATGTTTTG 

TTTATTAAGG 

GGTGAGTCAT 

CAAAATATAA 


TTAAGGAATT 
CAATAGATGA 
AAGCTAGTAT 
TTGACACTAT 
AGAGGCATGG 
TAAATGCTTA 
GCTTTATAGG 
TTTATATGAA 
CATCTAATTT 
TTATAGCAGA 
GGTTAGATGA 
AAGGTAGACT 
CCCTACTTGC 
TTGAATTACT 
TGTACAATAA 
TTAATTTGTC 
ATGATGGATA 
ATTGGAGATA 
ATAAATTGTT 
CTTACTGTCC 
TTTACGTTCA 
TATCTATAAG 
TTCAAGGAGA 
GAGTTAAGAA 
TGATGGATGA 
T CAT AT AT AG 
TATCTAGATG 
ATTTGGCAAC 
CATGCTCAAT 
CAACTATAAC 
CCTCTTTAAT 
TAAGGAATAT 
CGAATCTATT 
CTTTTTTGGA 
CCACCATGAT 


9660 

9720 

9780 

9840 

9900 

9960 

10020 

10080 

10140 

10200 

10260 

10320 

10380 

10440 

10500 

10560 

10620 

10680 

10740 

10800 

10860 

10920 

10980 

11040 

11100 

11160 

11220 

11280 

11340 

11400 

11460 

11520 

11580 

11640 


11700 
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AAAAAATATA 
ATTCACAAAT 
GGTAATTCTC 
TGCCATAGCT 
AGGACTGACA 
ACTAAGTAGG 
GGTAGACCTT 
GATAAGTGGA 
ATCAGAACAT 
TTTACCCGGT 
TTATTTTGGA 
TAGTAAACCT 
TGATGAGATA 
AGATAGTCTC 
GGATACTGCA 
AACAATGTCC 
TATTTATCAA 
AGAAACCACA 
TAAAGAAAGT 
TCCTGAAAGT 
ACTTATGGTT 
CATCATACAT 
AGATCGAGAT 
AATCACTGAA 
AGTAAATCAA 
GGATTATATA 
TGCATTATCT 
TTATGGTCCT 
TTCACTAGAT 
TGACAGCGAT 
ATTTGTTTGT 
C TTGGAGAG A 
TCTTAAATAT 
CGTAAGAAAG 
TGATGATTGG 


ACAGCAAGGA 
ACAATGATAG 
C CTAGAGTTG 
GGAATGTTAG 
TATAGTTTGT 
ACTTTGCGAC 
GCCATAGCAT 
CTTGAAACGC 
TGTAAAATAT 
AATATCAAAA 
TCAGTCACTG 
GCAAAAGCCG 
TCTTGGATGG 
AAAATTTTAA 
ACTCAGATGA 
AATGATAACA 
CAAATAATGT 
GGACACAACC 
TTTAATGATG 
AATGAATTTA 
ATTAAAGACC 
GCAATTTCAA 
AATTTAAAAG 
TTTTTGACTC 
TTTGCATACA 
ATGAGAACAC 
CATCCTAAAG 
AATACTGCTA 
CTATTTATGA 
ATGGAAGTTG 
TGTTTAGCAG 
CTTGATCTAT 
GTACAAATAT 
ACTGCAATCA 
GATCCGGTAG 


ATGTATTACA 

AAGAAGATGA 

CACATGATAT 

ATACGACAAA 

TGAGGAAAAT 

TAATTGTAAG 

TGCGACAAAA 

CTGACCCATT 

GTTATTCTTC 

TAGGATCAGC 

ATGAAAGATC 

CAATAAGAAT 

AAGCCTCACA 

CACCGGTAGC 

AATTCTCCAG 

TGTCTATCAA 

TAACAGGATT 

CTATAGTTAT 

AACATATTAA 

TTTATGATAA 

ATTCTTACAC 

TATGTACTGC 

AGATAATAGT 

TTGACATACT 

CTCTTTATAG 

TGAGAGATAC 

TATTCAAGAG 

GTCAAGACCA 

GAGAATGGTT 

CAAATGATAG 

AAATTGCATC 

TGAAACAATA 

CTGGATTATT 

AATATCTAAG 

AAGATGAAAA 


AGATTCACCA 
AGAATTAGC T 
TCTAGATAAT 
ATCACTAATT 
CAGTAATTAC 
TGATAAAATC 
GATGTGGATT 
AGAATTACTA 
AGATGGCACA 
AGAAACAGGT 
TGAAGCACAA 
AGCAATGATA 
GATAGCACAA 
TACATCAACA 
TACATCATTG 
AGAAGCTAAT 
AAGTGTTTTC 
GCATCTGCAC 
TCCAGAGTCT 
AGACCCACTC 
AATTGATATG 
AATTACAATA 
TATTGCAAAT 
TGTATTTCTC 
TCTAAAAATA 
TTCCCATTCA 
GTTCTGGGAT 
GATAAAACTT 
GAATGGTGTA 
GAAACAAGCC 
TTTCGGACCT 
TCTTGAATTA 
AATTAAATCG 
GATTCGCGGT 
TATGCTGGAT 


AATCCATTAT 
GAGTTCCTGA 
TCTCTCACAG 
CGGGTTGGCA 
GATCTAGTAC 
AAGTATGAAG 
CATTTATCAG 
T CTGGGGT AG 
AACCCATATA 
ATATCGTCAT 
T T AGG AT AT A 
TATACATGGG 
ACACGTGCAA 
AATTTATCAC 
ATCAGAGTCA 
GAAACCAAAG 
GAATATTTAT 
ATAGAAGATG 
ACATTAGAAT 
AAAGATGTGG 
AATTATTGGG 
GCAGATACTA 
GATGATGATA 
AAGACATTTG 
GAAGGTAGGG 
ATATTAAAAG 
TGTGGAGTTT 
GCCCTATCTA 
TCACTTGAAA 
TTTATTTCTA 
AACCTGTTAA 
AATATTAAAG 
TTCCCATCAA 
ATTAGTCCAC 
AACATTGTCA 


TATCTGGATT 

TGGACAGGAA 

GAATTAGAAA 

TAAATAGAGG 

AATATGAAAC 

ATATGTGTTC 

GAGGAAGGAT 

TAATAACAGG 

CTTGGATGTA 

TAAGAGTTCC 

TCAAGAATCT 

CATTTGGTAA 

ATTTTACACT 

ACAGATTAAA 

GCAGATTCAT 

ATACTAATCT 

TTAGATTAAA 

AGTGTTGTAT 

TAATTCGATA 

ACTTATCAAA 

ATGATACTGA 

TGTCACAATT 

TTAATAGCTT 

GTGGATTATT 

ATCTCATTTG 

TATTATCTAA 

TAAACCCTAT 

TATGTGAATA 

TATACATTTG 

GACACCTTTC 

ACTTAACATA 

AAGACCCTAC 

CTGTAACATA 

CTGAGGTAAT 

AAACTATAAA 


11760 

11820 

11880 

11940 

12000 

12060 

12120 

12180 

12240 

12300 

12360 

12420 

12480 

12540 

12600 

12660 

12720 

12780 

12840 

12900 

12960 

13020 

13080 

13140 

13200 

13260 

13320 

13380 

13440 

13500 

13560 

13620 

13680 

13740 


13800 
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TGATAACTGT 

AATAAAGATA 

ATAAAGGGAA 

TAAAATTAAC 

AATTTCTGGG 

GACTAGCACT 

13860 

TAAGAACTAT 

CAAGTCCTTA 

AAATCAGATC 

TATAACAAGT 

GATTCTGATG 

ATAATGATAG 

13920 

ACTAGATGCT 

AATACAAGTG 

GTTTGACACT 

TCCTCAAGGA 

GGGAATTATC 

TATCGCATCA 

13980 

ATTGAGATTA 

TTCGGAATCA 

ACAGCACTAG 

TTGTCTGAAA 

GCTCTTGAGT 

TATCACAAAT 

14040 

TTTAATGAAG 

GAAGTCAATA 

AAGACAAGGA 

CAGGCTCTTC 

CTGGGAGAAG 

GAGCAGGAGC 

14100 

TATGCTAGCA 

TGTTATGATG 

C C AC ATT AGG 

ACCTGCAGTT 

AATTATTATA 

ATTCAGGTTT 

14160 

GAATATAACA 

GATGTAATTG 

GTCAACGAGA 

ATTGAAAATA 

TTTCCTTCAG 

AGGTATCATT 

14220 

AGTAGGTAAA 

AAATTAGGAA 

ATGTGACACA 

GATTCTTAAC 

AGGGTAAAAG 

TACTGTTCAA 

14280 

TGGGAATCCT 

AATTCAACAT 

GGATAGGAAA 

TATGGAATGT 

GAGAGCTTAA 

TATGGAGTGA 

14340 

ATTAAATGAT 

AAGTCCATTG 

GATTAGTACA 

TTGTGATATG 

GAAGGAGCTA 

TCGGTAAATC 

14400 

AGAAGAAACT 

GTTCTACATG 

AACATTATAG 

TGTTATAAGA 

ATTACATACT 

TGATTGGGGA 

14460 

TGATGATGTT 

GTTTTAGTTT 

C C AAAATT AT 

ACCTACAATC 

ACTCCGAATT 

GGTCTAGAAT 

14520 

ACTTTATCTA 

TATAAATTAT 

ATTGGAAAGA 

TGTAAGTATA 

ATATCACTCA 

AAACTTCTAA 

14580 

TCCTGCATCA 

ACAGAATTAT 

ATCTAATTTC 

GAAAGATGCA 

TATTGTACTA 

TAATGGAACC 

14640 

TAGTGAAATT 

GTTTTATCAA 

AACTTAAAAG 

ATTGTCACTC 

TTGGAAGAAA 

ATAATCTATT 

14700 

AAAATGGATC 

ATTTTATCAA 

AGAAGAGGAA 

TAATGAATGG 

TTACATCATG 

AAATCAAAGA 

14760 

AGGAGAAAGA 

GAT T ATGGAA 

TCATGAGACC 

ATATCATATG 

GCACTACAAA 

TCTTTGGATT 

14820 

TCAAATCAAT 

TTAAATCATC 

TGGC GAAAG A 

ATTTTTATCA 

ACCCCAGATC 

TGACTAATAT 

14880 

CAACAATATA 

ATC CAAAGTT 

TTCAGCGAAC 

AATAAAGGAT 

GTTTTATTTG 

AATGGATTAA 

14940 

TATAACTCAT 

GATGATAAGA 

GACATAAATT 

AGGCGGAAGA 

TATAACATAT 

TCCCACTGAA 

15000 

AAATAAGGGA 

AAGTTAAGAC 

TGCTATCGAG 

AAGACTAGTA 

TTAAGTTGGA 

TTTCATTATC 

15060 

ATTATCGACT 

CGATTACTTA 

CAGGTCGCTT 

TCCTGATGAA 

AAATTTGAAC 

ATAGAGCACA 

15120 

GACTGGATAT 

GT AT C ATT AG 

CTGATACTGA 

TTTAGAATCA 

TTAAAGTTAT 

TGTCGAAAAA 

15180 

CATCATTAAG 

AATTACAGAG 

AGTGTATAGG 

ATCAATATCA 

TATTGGTTTC 

TAACCAAAGA 

15240 

AGTTAAAATA 

CTTATGAAAT 

TGATCGGTGG 

TGCTAAATTA 

TTAGGAATTC 

CCAGACAATA 

15300 

TAAAGAACCC 

GAAGACCAGT 

TATTAGAAAA 

CTACAATCAA 

CATGATGAAT 

TTGATATCGA 

15360 

TTAAAACATA 

AATACAATGA 

AGATATATC C 

TAAC CTTTAT 

CTTTAAGCCT 

AGGAATAGAC 

15420 

AAAAAGTAAG 

AAAAACATGT 

AATATATATA 

TACCAAACAG 

AGTTCTTCTC 

TTGTTTGGTG 

15480 

GGTCGGCATG 

GCATCTCCAC 

CTCCTCGCGG 

TCCGGACCTG 

GGCATCCGAA 

GGAGGACGCA 

15540 

CGTCCACTCG 

GATGGCTAAG 

GGAGAGCCTG 

CAGTAGCATA 

ACCCCTTGGG 

GCCTCTAAAC 

15600 

GGGTCTTGAG 

GGGTTTTTTG 

CTGAAAGGAG 

GAACTATATA 

CGCGTCGACG 

GGCCCCGCGC 

15660 


(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15666 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 



(D) TOPOLOGY: linear 


(ii) MOLECULE TYPE: CDNA 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 


TAATACGACT 

CACTATAGGA 

CCAAACAAGA 

GAAGAAACTT 

GTCTGGGAAT 

ATAAATTTAA 

60 

CTTTAAATTA 

AC TT AGG ATT 

AAAGACATTG 

ACTAGAAGGT 

CAAGAAAAGG 

GAACTCTATA 

120 

ATTTCAAAAA 

TGTTGAGCCT 

ATTTGATACA 

TTTAATGCAC 

GTAGGCAAGA 

AAACATAACA 

180 

AAATCAGCCG 

GTGGAGCTAT 

CATTCCTGGA 

CAGAAAAATA 

CTGTCTCTAT 

ATTCGCC CTT 

240 

GGACCGACAA 

TAACTGATGA 

TAATGAGAAA 

ATGACATTAG 

CTCTTCTATT 

TCTATCTCAT 

300 

TCACTAGATA 

ATGAGAAACA 

ACATGCACAA 

AGGGCAGGGT 

TCTTGGTGTC 

TTTATTGTCA 

360 

ATGGCTTATG 

CCAATCCAGA 

GCTCTACCTA 

ACAACAAATG 

GAAGTAATGC 

AGATGTCAAG 

420 

TATGTCATAT 

ACATGATTGA 

GAAAGATCTA 

AAACGGCAAA 

AGTATGGAGG 

ATTTGTGGTT 

480 

AAGACGAGAG 

AGATGATATA 

TGAAAAGACA 

ACTGATTGGA 

TATTTGGAAG 

TGACCTGGAT 

540 

TATGATCAGG 

AAACTATGTT 

GCAGAACGGC 

AGGAACAATT 

CAACAATTGA 

AGACCTTGTC 

600 

CACACATTTG 

GGTATCCATC 

ATGTTTAGGA 

GCTCTTATAA 

TACAGATCTG 

GATAGTTCTG 

660 

GTCAAAGCTA 

TCACTAGTAT 

CTCAGGGTTA 

AGAAAAGGCT 

TTTTCACCCG 

ATTGGAAGCT 

720 

TTCAGACAAG 

ATGGAACAGT 

GCAGGCAGGG 

CTGGTATTGA 

GCGGTGACAC 

AGTGGATCAG 

780 

ATTGGGTCAA 

TCATGCGGTC 

TCAACAGAGC 

TTGGTAACTC 

TTATGGTTGA 

AACATTAATA 

840 

ACAATGAATA 

CCAGCAGAAA 

TGACCTCACA 

ACCATAGAAA 

AGAATATACA 

AATTGTTGGC 

900 

AACTACATAA 

GAGATGCAGG 

TCTCGCTTCA 

TTCTTCAATA 

CAATCAGATA 

TGGAATTGAG 

960 

AC C AGAATGG 

CAGCTTTGAC 

TCTATCCACT 

CTCAGACCAG 

ATATCAATAG 

ATTAAAAG C T 

1020 

TTGATGGAAC 

TGTATTTATC 

AAAGGGACCA 

CGCGCTCCTT 

TCATCTGTAT 

CCTCAGAGAT 

1080 

CCTATACATG 

GTGAGTTCGC 

ACCAGGCAAC 

TATCCTGCCA 

TATGGAGCTA 

TGCAATGGGG 

1140 

GTGGCAGTTG 

TACAAAATAG 

AGCCATGCAA 

CAGTATGTGA 

CGGGAAGATC 

ATATCTAGAC 

1200 

ATTGATATGT 

TCCAGCTAGG 

ACAAGCAGTA 

GCACGTGATG 

CCGAAGCTCA 

AATGAGCTCA 

1260 

ACACTGGAAG 

ATGAACTTGG 

AGTGACACAC 

GAATC TAAAG 

AAAGCTTGAA 

GAGACATATA 

1320 

AGGAACATAA 

ACAGTTCAGA 

GACATCTTTC 

CACAAACCGA 

CAGGTGGATC 

AGC C ATAGAG 

1380 

ATGGCAATAG 

ATGAAGAGCC 

AGAACAATTC 

GAACATAGAG 

CAGATCAAGA 

ACAAAATGGA 

1440 

GAACCTCAAT 

CATC CAT AAT 

TCAATATGCC 

TGGGCAGAAG 

GAAATAGAAG 

CGATGATCAG 

1500 

ACTGAGCAAG 

CTACAGAATC 

TGACAATATC 

AAGACCGAAC 

AACAAAACAT 

CAGAGACAGA 

1560 

CTAAACAAGA 

GACTCAACGA 

CAAGAAGAAA 

CAAAGCAGTC 

AACCACCCAC 

TAATCCCACA 

1620 

AACAGAACAA 

ACCAGGACGA 

AATAGATGAT 

CTGTTTAA CG 

CATTTGGAAG 

CAACTAATCG 

1680 

AAT CAAC ATT 

TTAATCTAAA 

TCAATAATAA 

ATAAGAAAAA 

CTTAGGATTA 

AAGAATCCTA 

1740 

TCATACCGGA 

ATATAGGGTG 

GTAAATTTAG 

AGTCTGCTTG 

AAACTCAATC 

AATAGAGAGT 

1800 

TGATGGAAAG 

CGATGCTAAA 

AACTATCAAA 

TCATGGATTC 

TTGGGAAGAG 

GAATCAAGAG 

1860 



168 


ATAAATCAAC 

CCCAAGAAGA 

CCATCTGTCA 

ACAAAAATAG 

TTGATCAGGA 

AGACTGTGGT 

ACACGGAGGA 

AAATGCGACA 

CAGAACAAAG 

CAAGATCAAT 

AAAATAGTAG 

AAGGGGGAAA 

CATCAGACTA 

ACACCGACAC 

CATGGAATCT 

CAACAACAAC 

AACCCAAGAC 

TTACAGAGAG 

TAGATTTATA 

CTGCATCAAA 

CAAAATTAAC 

ACGAATCACA 

TAATTTCAAA 

ATGAGAGAGT 

TTGACCCACT 

CAGGAGATAC 

AGTCAAATGC 

CAGTCATCAA 

AACGTTGCAA 

ACAATTGCCA 

GATCAAAACC 

ATACAAATAA 

ACTAACTCTG 

CCATTACCAC 

ATCGGAAATC 


TAATATCTCC 
CTTATCGGAA 
AC C AGAAATC 
ACAGTCTGGG 
AACTGTACAG 
CTCTGGAGGA 
TATTGATCTC 
ATCTGCAAAT 
TAGAAACAGT 
AAGTGTTGTT 
GACAAAGAAA 
AGGGAAAGAC 
CAGATCCACA 
AAAGGGGCAA 
CATCATCGAC 
TTCCAGATCA 
ACAAAAGACA 
GGCAATTACT 
TCAAGACAAA 
GATAGATTTC 
ACAGATACAA 
TAGAAGATTG 
TCTCAAAATT 
ATCCATGATC 
TATGGAGGCA 
ACTAGAGAAC 
AACAAGACTA 
CAACAGCAAT 
AAATGATGAA 
ATGATCCAAC 
TGTCAACACA 
GAAAAACTTA 
CAATATACAC 
TCAAAGTCAA 
CACCAAAACA 


TCGGCCCTCA 
AACGACACAA 
AAACCAACAG 
TCATCACACG 
AGAGGACCTG 
ATCCCCAGAA 
AATGAAATTA 
GTTCCAAGCG 
GATCATGGAA 
ACTG CTGCAA 
AGTTCTTCAA 
TGGTTTAAGA 
TCAAAAGGGC 
ACAGAAATAC 
AACAACACCG 
ACTTATACAA 
AATGGAAAGG 
CTATTGCAGA 
CGAGTTGTAT 
CTGGCAGGAT 
AATGAAATGC 
ATAGAAAATC 
ATGACTGAGA 
AAAACAAAAT 
CAAGGCATTG 
GATGTACAAG 
ATACC CAAAA 
CTCTCACAAA 
GAAGTATCTG 
AAAGAAACGA 
CACAAAATCA 
GGATTAAAGA 
ATTCCCAGAA 
TGAACAGAGG 
CGGATCCCGG 


ACATCATTGA 

TCAACACAAG 

AAACAAGTGA 

AATGTACAAC 

GGAGAAGAAG 

GCATCACAGA 

GAAAGATGGA 

AGATATCAGG 

GAAGCCTGGA 

CACCAGATGA 

CACATCAAGA 

AATCAAAAGA 

AGAAGAAAAT 

AGACAGAATC 

ACCGGAACGA 

AAGAATCGAT 

AAAGGAAGGA 

ATCTTGGTGT 

GTGTAGCAAA 

TAGTCATAGG 

TAAACCTCAA 

AAAGAGAACA 

GAGGAGGAAA 

TGAAAGAAGA 

ACAAGAATAT 

TTAAATCAGA 

AAGTGAGCAG 

GCACAAAACA 

AATTAATGGA 

CACCGAACAA 

AGCAGAATGA 

ATAAATTAAT 

TCATCATTCT 

AAAGCAGTAC 

TATTTAGATG 


ATTCATACTC 
AACCCAGCAA 
GAAAGATAGT 
AGAAGCAAAA 
CAGCTCAGAT 
TTCTAAAAAT 
TAAGGACTCT 
AAGTGATGAC 
ATCTATCAGT 
TGAAGAAGAA 
AGATGACAAA 
TACCGACAAC 
CTCAAAGACA 
ATCAGAAACA 
ACAGACAAGC 
CCGAACAAAC 
TACAGAAGAG 
AATTCAATCC 
TGTACTAAAC 
GGTTTCAATG 
AGCAGATCTA 
ACTGTCATTG 
GAAAGACCAA 
AAAGATCAAG 
ACCCGATCTA 
GATATTAAGT 
TACAATGAGA 
ATCATACATA 
CATGTTCAAT 
ACAGACAAGA 
AACAACAGAT 
CCTTGTC CAA 
CTGAAAATGG 
CCCACATTAG 
TCTTCTTACT 


AGCACCGACC 

CTCAGTGCCA 

GGATCAACTG 

GATAGAAATA 

AGTAGAGCTG 

GGAACCCAAA 

ATTGAGGGGA 

ATATTTACAA 

ACACCTGATA 

ATACTAATGA 

AGAATTAAAA 

CAGATACCAA 

ACAACCACCA 

CAATCCTCAT 

ACAACTCCTC 

TCTGAATCCA 

AGCAATCGAT 

ACATCAAAAC 

AATGTAGATA 

GACAACGACA 

AAGAAAATGG 

ATCACGTCAC 

AATGAATCCA 

AAGACCAGGT 

TATCGACATG 

TCATACAATG 

TCACTAGTTG 

AACGAACTCA 

GAAGATGTCA 

AACAACAGTA 

ATCAATCAAT 

AATGAGTATA 

TCATATAGAA 

AGTTGCCAAG 

CGGCTTCTTC 


1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 

3780 

3840 

3900 

3960 
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GAGATGGAAC GAATCAAAGA CAAATACGGG AGTGTGAATG ATCTCGACAG TGACCCGAGT 4 02 0 
TACAAAGTTT GTGGCTCTGG ATCATTACCA ATCGGATTGG CTAAGTACAC TGGGAATGAC 4 080 
CAGGAATTGT TACAAGCCGC AACCAAACTG GATATAGAAG TGAGAAGAAC AGTCAAAGCG 414 0 
AAAGAGATGG TTGTTTACAC GGTACAAAAT ATAAAACCAG AACTGTACCC ATGGTCCAAT 42 0 0 
AGACTAAGAA AAGGAATGCT GTTCGATGCC AACAAAGTTG CTCTTGCTCC TCAATGTCTT 4260 
CCACTAGATA GGAGCATAAA ATTTAGAGTA ATCTTCGTGA ATTGTACGGC AATTGGATCA 43 2 0 
ATAAC CTTGT TCAAAATTCC TAAGTCAATG GCATCACTAT CTCTACCCAA CACAATATCA 4380 
ATCAATCTGC AGGTACACAT AAAAACAGGG GTTCAGACTG ATTCTAAAGG GATAGTTCAA 444 0 
ATTTTGGATG AGAAAGGCGA AAAATCACTG AATTTCATGG TCCATCTCGG ATTGATCAAA 4500 
AGAAAAGTAG GCAGAATGTA CTCTGTTGAA TACTGTAAAC AGAAAATCGA GAAAATGAGA 4 56 0 
TTGATATTTT CTTTAGGACT AGTTGGAGGA ATCAGTCTTC ATGTCAATGC AACTGGGTCC 4620 
ATATCAAAAA CACTAGCAAG TCAGCTGGTA TTCAAAAGAG AGATTTGTTA TCCTTTAATG 468 0 
GATCTAAATC CGCATCTCAA TCTAGTTATC TGGGCTTCAT CAGTAGAGAT TACAAGAGTG 4 740 
GATGCAATTT TCCAACCTTC TTTACCTGGC GAGTTCAGAT ACTATCCTAA TATTATTGCA 4800 
AAAGGAGTTG GGAAAATCAA ACAATGGAAC TAGTAATCTC TATTTTAGTC CGGACGTATC 4 86 0 
TATTAAGCCG AAGCAAATAA AGGATAATCA AAAACTTAGG ACAAAAGAGG TCAATAC CAA 4 92 0 
CAACTATTAG CAGTCACACT CGCAAGAATA AGAGAGAAGG GACCAAAAAA GT C AAATAGG 4 98 0 
AGAAATCAAA ACAAAAGGTA CAGAACACCA GAACAACAAA ATCAAAACAT CCAACTCACT 504 0 
CAAAACAAAA ATTCCAAAAG AGACCGGCAA CACAACAAGC ACTGAACACA ATGCCAACTT 5100 
CAATACTGCT AATTATTACA AC C ATGATC A TGGCATCTTT CTGCCAAATA GATATCACAA 516 0 
AACTACAGCA CGTAGGTGTA TTGGTCAACA GTCCCAAAGG GATGAAGATA T C AC AAAACT 522 0 
TTGAAACAAG ATATCTAATT TTGAGCCTCA TACCAAAAAT AGAAGACTCT AACTCTTGTG 52 8 0 
GTGACCAACA GATCAAGCAA TACAAGAAGT TATTGGATAG ACTGATCATC CCTTTATATG 5340 
ATGGATTAAG ATTACAGAAA GATGTGATAG TAACCAATCA AGAATCCAAT GAAAACACTG 54 00 
ATCCCAGAAC AAAACGATTC TTTGGAGGGG TAATTGGAAC CATTGCTCTG GGAGTAGCAA 5460 
CCTCAGCACA AATTACAGCG GCAGTTGCTC TGGTTGAAGC CAAGCAGGCA AGATCAGACA 552 0 
TCGAAAAACT CAAAGAAGCA AT T AGGGAC A CAAACAAAGC AGTGCAGTCA GTTCAGAGCT 558 0 
CCATAGGAAA TTTAATAGTA GCAATTAAAT CAGTCCAGGA TTATGTTAAC AAAGAAATCG 564 0 
TGCCATCGAT TGCGAGGCTA GGTTGTGAAG CAGCAGGACT TCAATTAGGA ATTGCATTAA 5700 
CACAGCATTA CTCAGAATTA ACAAACATAT TTGGTGATAA CATAGGATCG TTACAAGAAA 5760 
AAGGAATAAA ATTACAAGGT ATAGCATCAT TATACCGCAC AAATATCACA GAAATATTCA 582 0 
CAACATCAAC AGTTGATAAA TATGATATCT ATGATCTGTT ATTTACAGAA TCAATAAAGG 5880 
TGAGAGTTAT AGATGTTGAC TTGAATGATT ACTCAATCAC CCTCCAAGTC AGACTCCCTT 594 0 
TATTAACTAG GCTGCTGAAC ACTCAGATCT ACAAAGTAGA TTCCATATCA TATAACATCC 6000 


AAAACAGAGA ATGGTATATC CCTCTTCCCA GC CATATC AT GACGAAAGGG GCATTTCTAG 


6060 



GTGGAGCAGA 

CGTCAAAGAA 

TGTATAGAAG 

CATTCAGCAG 

CTATATATGC 

CCTTCTGATC 

6120 

CAGGATTTGT 

ATTAAACCAT 

GAAATAGAGA 

GCTGCTTATC 

AGGAAACATA 

TCCCAATGTC 

6180 

CAAGAACAAC 

GGTCACATCA 

GACATTGTTC 

CAAGATATGC 

ATTTGTCAAT 

GGAGGAGTGG 

6240 

TTGCAAACTG 

TATAACAACC 

ACCTGTACAT 

GCAACGGAAT 

TGGTAATAGA 

ATCAATCAAC 

6300 

CACCTGATCA 

AGGAGTAAAA 

ATTATAACAC 

ATAAAGAATG 

TAGTACAATA 

GGTATCAACG 

6360 

GAATGCTGTT 

CAATACAAAT 

AAAGAAGGAA 

CTCTTGCATT 

CTATACACCA 

AATGATATAA 

6420 

CACTAAACAA 

TTCTGTTGCA 

CTTGATCCAA 

TTGACATATC 

AATCGAGCTC 

AACAAGGCCA 

6480 

AATCAGATCT 

AGAAGAATCA 

AAAGAATGGA 

TAAGAAGGTC 

AAATCAAAAA 

CTAGATTCTA 

6540 

TTGGAAATTG 

GCATCAATCT 

AGCACTACAA 

TCATAATTAT 

TTTGATAATG 

ATCATTATAT 

6600 

TGTTTATAAT 

TAATATAACG 

ATAATTACAA 

TTGCAATTAA 

GTATTACAGA 

ATTCAAAAGA 

6660 

GAAATCGAGT 

GGATCAAAAT 

GAC AAGC CAT 

ATGTACTAAC 

AAACAAATAA 

CATATCTACA 

6720 

GATCATTAGA 

TATTAAAATT 

ATAAAAAACT 

TAGGAGTAAA 

GTTACGCAAT 

CCAACTCTAC 

6780 

TCATATAATT 

GAGGAAGGAC 

CCAATAGACA 

AATCCAAATT 

CGAGATGGAA 

TACTGGAAGC 

6840 

ATACCAATCA 

CGGAAAGGAT 

GCTGGTAATG 

AGCTGGAGAC 

GTCTATGGCT 

ACTCATGGCA 

6900 

ACAAGCTCAC 

TAATAAGATA 

ATATACATAT 

TATGGACAAT 

AATCCTGGTG 

TTATTATCAA 

6960 

TAGTCTTCAT 

CATAGTGCTA 

ATT AATTC C A 

TCAAAAGTGA 

AAAGGCCCAC 

GAATCATTGC 

7020 

TGCAAGACAT 

AAATAATGAG 

TTTATGGAAA 

TTACAGAAAA 

GATCCAAATG 

GCATCGGATA 

7080 

ATACCAATGA 

TCTAATACAG 

TCAGGAGTGA 

ATACAAGGCT 

TCTTACAATT 

CAGAGTCATG 

7140 

TCCAGAATTA 

CATACCAATA 

TCATTGACAC 

AACAGATGTC 

AGATCTTAGG 

AAATTCATTA 

7200 

GTGAAATTAC 

AATTAGAAAT 

GATAATCAAG 

AAGTGCTGCC 

ACAAAGAATA 

ACACATGATG 

7260 

TAGGTATAAA 

ACCTTTAAAT 

C C AG ATG ATT 

TTTGGAGATG 

CACGTCTGGT 

CTTCCATCTT 

7320 

TAATGAAAAC 

TCCAAAAATA 

AGGTTAATGC 

CAGGGCCGGG 

ATTATTAGCT 

ATGCCAACGA 

7380 

CTGTTGATGG 

CTGTGTTAGA 

ACTCCGTCTT 

TAGTTATAAA 

TGATCTGATT 

TATGCTTATA 

7440 

CCTCAAATCT 

AATTACTCGA 

GGTTGTCAGG 

ATATAGGAAA 

ATCATATCAA 

GTCTTACAGA 

7500 

TAGGGATAAT 

AACTGTAAAC 

TCAGACTTGG 

TACCTGACTT 

AAATCCTAGG 

ATCTCTCATA 

7560 

CCTTTAACAT 

AAATGACAAT 

AGGAAGTCAT 

GTTCTCTAGC 

ACTCCTAAAT 

ATAGATGTAT 

7620 

ATCAACTGTG 

TTCAACTCCC 

AAAGTTGATG 

AAAGATCAGA 

TTATGCATCA 

TCAGGCATAG 

7680 

AAGATATTGT 

ACTTGATATT 

GTCAATTATG 

ATGGTTCAAT 

CTCAACAACA 

AGATTTAAGA 

7740 

ATAATAACAT 

AAGCTTTGAT 

C AAC CAT ATG 

CTGCACTATA 

CCCATCTGTT 

GGACCAGGGA 

7800 

TATACTACAA 

AGGCAAAATA 

ATATTTCTCG 

GGTATGGAGG 

TCTTGAACAT 

C C AATAAATG 

7860 

AGAATGTAAT 

CTGCAACACA 

ACTGGGTGCC 

CCGGGAAAAC 

ACAGAGAGAC 

TGTAATCAAG 

7920 

CATCTCATAG 

TACTTGGTTT 

TCAGATAGGA 

GGATGGTCAA 

CTCCATCATT 

GTTGTTGACA 

7980 

AAGGCTTAAA 

CTCAATTCCA 

AAATTGAAAG 

TATGGACGAT 

ATCTATGCGA 

CAAAATTACT 

8040 

GGGGGTCAGA 

AGGAAGGTTA 

CTTCTACTAG 

GTAACAAGAT 

CTATATATAT 

ACAAGATCTA 

8100 

CAAGTTGGCA 

TAGCAAGTTA 

CAATTAGGAA 

TAATTGATAT 

TACTGATTAC 

AGTGATATAA 

8160 
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GGATAAAATG 

GACATGGCAT 

AATGTG C TAT 

CAAGACCAGG 

AAACAATGAA 

TGTCCATGGG 

8220 

GACATTCATG 

TCCAGATGGA 

TGTATAACAG 

GAGTATATAC 

TGATG CAT AT 

C CACTCAATC 

8280 

CCACAGGGAG 

CATTGTGTCA 

TCTGTCATAT 

TAGACTCACA 

AAAAT CGAGA 

GTGAACC CAG 

8340 

TCATAACTTA 

CTCAACAGCA 

AC CGAAAG AG 

TAAACGAGCT 

GGCCATCCTA 

AACAGAACAC 

8400 

TCTCAGCTGG 

ATATACAACA 

ACAAGCTGCA 

TTACACACTA 

TAACAAAGGA 

TATTGTTTTC 

8460 

ATATAGTAGA 

AATAAATCAT 

AAAAGCTTAA 

ACACATTTCA 

ACCCATGTTG 

TTCAAAACAG 

8520 

AGATTCCAAA 

AAGCTGCAGT 

TAATCATAAT 

TAACCATAAT 

ATGCATCAAT 

CTATCTATAA 

8580 

TACAAGTATA 

TGATAAGTAA 

TCAGCAATCA 

GACAATAGAC 

AAAAGGGAAA 

TATAAAAAAC 

8640 

TTAGGAGCAA 

AGCGTGCTCG 

GGAAATGGAC 

ACTGAATCTA 

ACAATGGCAC 

TGTATCTGAC 

8700 

ATACTCTATC 

CTGAGTGTCA 

CCTTAACTCT 

CCTATCGTTA 

AAGGTAAAAT 

AGCACAATTA 

8760 

CACACTATTA 

TGAGTCTACC 

TCAGCCTTAT 

GATATGGATG 

ACGACTCAAT 

ACTAGTTATC 

8820 

ACTAGACAGA 

AAATAAAACT 

TAATAAATTG 

GATAAAAGAC 

AACGATCTAT 

TAGAAGATTA 

8880 

AAATTAATAT 

TAACTGAAAA 

AGTGAATGAC 

TTAGGAAAAT 

ACACATTTAT 

CAGATATCCA 

8940 

GAAATGTCAA 

AAGAAATGTT 

CAAATTATAT 

ATACCTGGTA 

TTAACAGTAA 

AGTGACTGAA 

9000 

TTATTACTTA 

AAGCAGATAG 

AACATATAGT 

CAAATGACTG 

ATGGATTAAG 

AGATCTATGG 

9060 

ATTAATGTGC 

TATCAAAATT 

AGCCTCAAAA 

AATGATGGAA 

GCAATTATGA 

TCTTAATGAA 

9120 

GAAATTAATA 

ATATATCGAA 

AGTTCACACA 

ACCTATAAAT 

CAGATAAATG 

GTATAATCCA 

9180 

TTCAAAACAT 

GGTTTACTAT 

CAAGTATGAT 

ATGAGAAGAT 

TACAAAAAGC 

TCGAAATGAG 

9240 

ATCACTTTTA 

ATGTTGGGAA 

GGATTATAAC 

TTGTTAGAAG 

ACCAGAAGAA 

TTTCTTATTG 

9300 

ATACATCCAG 

AATTGGTTTT 

GATATTAGAT 

AAACAAAACT 

ATAATGGTTA 

TCTAATTACT 

9360 

CCTGAATTAG 

TATTGATGTA 

TTGTGACGTA 

GTCGAAGGCC 

GATGGAATAT 

AAGTGCATGT 

9420 

GCTAAGTTAG 

ATCCAAAATT 

ACAATCTATG 

TATCAGAAAG 

GTAATAACCT 

GTGGGAAGTG 

9480 

ATAGATAAAT 

TGTTTCCAAT 

TATGGGAGAA 

AAGACATTTG 

ATGTGATATC 

GTTATTAGAA 

9540 

CCACTTGCAT 

TATCCTTAAT 

TCAAACTCAT 

GATCCTGTTA 

AACAACTAAG 

AGGAGCTTTT 

9600 

TTAAATCATG 

TGTTATCCGA 

GATGGAATTA 

ATATTTGAAT 

CTAGAGAATC 

GATTAAGGAA 

9660 

TTTCTGAGTG 

TAGATTACAT 

TGATAAAATT 

TTAGATATAT 

TTAATAAGTC 

TACAATAGAT 

9720 

GAAATAGCAG 

AGATTTTCTC 

TTTTTTTAGA 

ACATTTGGGC 

ATCCTCCATT 

AGAAGCTAGT 

9780 

ATTGCAGCAG 

AAAAGGTTAG 

AAAATATATG 

TATATTGGAA 

AACAATTAAA 

ATTTGACACT 

9840 

ATTAATAAAT 

GTCATGCTAT 

CTTCTGTACA 

ATAATAATTA 

ACGGATATAG 

AGAGAGGCAT 

9900 

GGTGGACAGT 

GGCCTCCTGT 

GACATTACCT 

GATCATGCAC 

ACGAATTCAT 

CATAAATGCT 

9960 

TACGGTTCAA 

ACTCTGCGAT 

ATCATATGAA 

AATGCTGTTG 

ATTATTACCA 

GAGCTTTATA 

10020 

GGAATAAAAT 

TCAATAAATT 

CATAGAGCCT 

CAGTTAGATG 

AGGATTTGAC 

AATTTATATG 

10080 

AAAGATAAAG 

CATTATCTCC 

AAAAAAATCA 

AATTGGGACA 

CAGTTTATCC 

TGCATCTAAT 

10140 

TTACTGTACC 

GTACTAACGC 

ATCCAACGAA 

TCACGAAGAT 

TAGTTGAAGT 

ATTTATAGCA 

10200 

GATAGTAAAT 

TTGATCCTCA 

TCAGATATTG 

GATTATGTAG 

AATCTGGGGA 

CTGGTTAGAT 

10260 



10320 


GATCCAGAAT 
CTCTTTGCAA 
GCAAATAACA 
CTTAAGAGAT 
AATTCTAAAA 
TCTTCTAATC 
TACGAGACTG 
TATGAATCAA 
TTTAATTGGT 
CCTCCATCAG 
CATAACCCAA 
AGTGCAATAC 
GACAATCAAG 
AAGGAGATAG 
GATCTAGGTC 
AGCAAAAGAA 
TGTGTCTTCT 
ACATCATTTG 
ATTTTTAAGA 
ACACAGAATA 
ATACCTGCTA 
ATTGGTGATC 
TTAGACCGAA 
GACTGGGCTT 
ATAAAAAATA 
TTATTCACAA 
AAGGTAATT C 
AATGCCATAG 
GGAGGACTGA 
ACACTAAGTA 
TCGGTAGACC 
ATGATAAGTG 
GGATCAGAAC 
TATTTACCCG 
CCTTATTTTG 


TTAATATTTC 

AAATGACATA 

TAGGAAAATT 

TAACAACCAT 

GCCATACAGA 

AGAAATCAAA 

TGAGCTGTTT 

CAGCTCTATT 

TACACCCTCG 

ATAAAGAACA 

GAGGGGGTAT 

ATCTAGCAGC 

CTATAGCTGT 

TTTATAAAGA 

ATGAACTTAA 

TCTATTATGA 

GGTCAGAGAC 

CAAAAGCAAT 

ATATTCAACA 

TCAGAGATCA 

GTGTTGGGGG 

CATCAGTTGC 

GTGTTCTTTA 

CAGATCCATA 

TAACAGCAAG 

ATACAATGAT 

TCCCTAGAGT 

CTGGAATGTT 

CATATAGTTT 

GGACTTTGCG 

TTGCCATAGC 

GACTTGAAAC 

ATTGTAAAAT 

GTAATATCAA 

GATCAGTCAC 


TTATAGTCTT 

CAAAATGAGA 

CTTTCAAGAA 

ATCAATATCA 

TGACCTTAAA 

GAAATTTGAA 

CCTAACAACA 

TGGAGAAACT 

TCTTGAAGGA 

TATATCATTA 

AGAAGGATTT 

TGTTAGAATA 

AACCACAAGA 

TGTAGTGAGA 

ATTAAATGAA 

TGGGAGAATT 

AGTAATAGAC 

TGAGAATGGT 

ACTATATATT 

GTATTTTAGG 

ATTCAATTAC 

CGCATTGGCT 

TAGGATTATG 

TTCATGCAAT 

GAATGTATTA 

AGAAGAAGAT 

TGCACATGAT 

AGATACGACA 

GTTGAGGAAA 

ACTAATTGTA 

ATTGCGACAA 

GCCTGACCCA 

ATGTTATTCT 

AATAGGATCA 

TGATGAAAGA 


AAAGAAAAAG 

GCTACACAAG 

AATGGGATGG 

GGAGTTCCAC 

ACCTACAATA 

TTCAAGTCAA 

GATCTCAAAA 

TGCAACCAAA 

AGTACAATCT 

GAGGATCACC 

TGTCAAAAAT 

GGCGTGAGGG 

GTACC CAAC A 

TTTTTTGATT 

ACGATTATAA 

CTTCCTCAAG 

GAAACAAGAT 

TATTCACCTG 

GCCCTTGGGA 

AATCCAAATT 

ATGGCCATGT 

GATATTAAAA 

AATCAAGAAC 

TTACCACAAT 

CAAGATTCAC 

GAAGAATTAG 

ATTCTAGATA 

AAATCACTAA 

ATCAGTAATT 

AGTGATAAAA 

AAGATGTGGA 

TTAGAATTAC 

TCAGATGGCA 

GCAGAAACAG 

TCTGAAGCAC 


AGATCAAACA 
TTTTATCAGA 
TGAAGGGAGA 
GGTATAATGA 
AAATAAGTAA 
CGGATATCTA 
AATACTGTCT 
TATTTGGATT 
ATGTAGGTGA 
CTGATTCTGG 
T ATGGAC AC T 
TGACTGCAAT 
ATTATGACTA 
CATTAAGAGA 
GTAGCAAGAT 
CTCTAAAAGC 
CAGCATCTTC 
TTCTAGGATA 
TGAATATCAA 
GGATGCAATA 
CAAGATGTTT 
GATTTATTAA 
CAGGTGAGTC 
CTCAAAATAT 
CAAATCCATT 
CTGAGTTCCT 
ATTCTCTCAC 
TTCGGGTTGG 
ACGATCTAGT 
TCAAGTATGA 
TTCATTTATC 
TATCTGGGGT 
CAAACCCATA 
GTATATCGTC 
AATTAGGATA 


GGAAGGTAGA 
GACCCTACTT 
GATTGAATTA 
AGTGTACAAT 
TCTTAATTTG 
CAATGATGGA 
TAATTGGAGA 
AAATAAATTG 
TCCTTACTGT 
TTTTTACGTT 
CATATCTATA 
GGTTCAAGGA 
CAGAGTTAAG 
AGTGATGGAT 
GTTCATATAT 
ATTATCTAGA 
AAATTTGGCA 
TGCATGCTCA 
TCCAACTATA 
TGC CTCTTTA 
TGTAAGGAAT 
GGCGAATCTA 
ATCTTTTTTG 
AAC C ACCATG 
ATTATCTGGA 
GATGGACAGG 
AGGAATTAGA 
CATAAATAGA 
ACAATATGAA 
AGATATGTGT 
AGGAGGAAGG 
AGTAATAACA 
TACTTGGATG 
ATTAAGAGTT 
TATCAAGAAT 


10380 

10440 

10500 

10560 

10620 

10680 

10740 

10800 

10860 

10920 

10980 

11040 

11100 

11160 

11220 

11280 

11340 

11400 

11460 

11520 

11580 

11640 

11700 

11760 

11820 

11880 

11940 

12000 

12060 

12120 

12180 

12240 

12300 

12360 



173 


CTTAGTAAAC 

AATGATGAGA 

CTAGATAGTC 

AAGGATACTG 

ATAACAATGT 

CTTATTTATC 

AAAGAAACCA 

ATTAAAGAAA 

TATCCTGAAA 

AAACTTATGG 

GACATCATAC 

TTAGATCGAG 

TTAATCACTG 

TTAGTAAATC 

TGGGATTATA 

AATGCATTAT 

ATTTATGGTC 

TATTCACTAG 

TGTGACAGCG 

TCATTTGTTT 

TACTTGGAGA 

ACTCTTAAAT 

TACGTAAGAA 

ATTGATGATT 

AATGATAACT 

CTTAAGAACT 

AGACTAGATG 

CAATTGAGAT 

ATTTTAATGA 

GCTATGCTAG 

TTGAATATAA 

TTAGTAGGTA 

AATGGGAATC 

GAATTAAATG 

TCAGAAGAAA 


CTGCAAAAGC 

TATCTTGGAT 

TCAAAATTTT 

CAACTCAGAT 

CCAATGATAA 

AACAAATAAT 

CAGGACACAA 

GTTTTAATGA 

GTAATGAATT 

TTATTAAAGA 

ATGCAATTTC 

ATAATTTAAA 

AATTTTTGAC 

AATTTGCATA 

TAATGAGAAC 

CTCATCCTAA 

CTAATACTGC 

ATCTATTTAT 

ATATGGAAGT 

GTTGTTTAGC 

GACTTGATCT 

ATGTACAAAT 

AGACTGCAAT 

GGGATCCGGT 

GTAATAAAGA 

ATCAAGTCCT 

CTAATACAAG 

TATTCGGAAT 

AGGAAGTCAA 

CATGTTATGA 

CAGATGTAAT 

AAAAATTAGG 

CTAATTCAAC 

ATAAGTCCAT 

CTGTTCTACA 


CGCAATAAGA 

GGAAGCCTCA 

AACACCGGTA 

GAAATTCTCC 

CATGTCTATC 

GTTAACAGGA 

CCCTATAGTT 

TGAACATATT 

TATTTATGAT 

CCATTCTTAC 

AATATGTACT 

AGAGATAATA 

TCTTGACATA 

CACTCTTTAT 

ACTGAGAGAT 

AGTATTCAAG 

TAGTCAAGAC 

GAGAGAATGG 

TGCAAATGAT 

AGAAATTGCA 

ATTGAAACAA 

ATCTGGATTA 

CAAATATCTA 

AGAAGATGAA 

TAATAAAGGG 

TAAAATCAGA 

TGGTTTGACA 

CAACAGCACT 

TAAAGACAAG 

TGCCACATTA 

TGGTCAACGA 

AAATGTGACA 

ATGGATAGGA 

TGGATTAGTA 

TGAACATTAT 


ATAGCAATGA 

CAGATAGCAC 

GCTACATCAA 

AGTACATCAT 

AAAGAAGCTA 

TTAAGTGTTT 

ATGCATCTGC 

AATCCAGAGT 

AAAGACCCAC 

ACAATTGATA 

GCAATTACAA 

GTTATTGCAA 

CTTGTATTTC 

AGTCTAAAAA 

ACTTCCCATT 

AGGTTCTGGG 

CAGATAAAAC 

TTGAATGGTG 

AGGAAACAAG 

TCTTTCGGAC 

TATCTTGAAT 

TTAATTAAAT 

AGGATTCGCG 

AATATGCTGG 

AATAAAATTA 

TCTATAACAA 

CTTCCTCAAG 

AGTTGTCTGA 

GACAGGCTCT 

GGACCTGCAG 

GAATTGAAAA 

CAGATTCTTA 

AATATGGAAT 

CATTGTGATA 

AGTGTTATAA 


TATATACATG 
AAACACGTG C 
CAAATTTATC 
TGATCAGAGT 
ATGAAACCAA 
TCGAATATTT 
ACATAGAAGA 
CTACATTAGA 
TCAAAGATGT 
TGAATTATTG 
TAGCAGATAC 
ATGATGATGA 
TCAAGACATT 
TAGAAGGTAG 
CAATATTAAA 
ATTGTGGAGT 
TTGCCCTATC 
TATCACTTGA 
CCTTTATTTC 
CTAACCTGTT 
TAAATATTAA 
CGTTCCCATC 
GTATTAGTCC 
ATAACATTGT 
ACAATTTCTG 
GTGATTCTGA 
GAGGGAATTA 
AAGCTCTTGA 
TCCTGGGAGA 
TTAATTATTA 
TATTTCCTTC 
ACAGGGTAAA 
GTGAGAGCTT 
TGGAAGGAGC 
GAATTACATA 


GGCATTTGGT 

AAATTTTACA 

ACACAGATTA 

CAGCAGATTC 

AGATACTAAT 

ATTTAGATTA 

TGAGTGTTGT 

ATTAATTCGA 

GGACTTATCA 

GGATGATACT 

TATGTCACAA 

TATTAATAGC 

TGGTGGATTA 

GGATCTCATT 

AGTATTATCT 

TTTAAACCCT 

TATATGTGAA 

AATATACATT 

TAGACACCTT 

AAACTTAACA 

AGAAGACCCT 

AACTGTAACA 

ACCTGAGGTA 

CAAAACTATA 

GGGACTAGCA 

TGATAATGAT 

TCTATCGCAT 

GTTATCACAA 

AGGAGCAGGA 

TAATTCAGGT 

AGAGGTATCA 

AGTACTGTTC 

AATATGGAGT 

TATCGGTAAA 

CTTGATTGGG 


12420 

12480 

12540 

12600 

12660 

12720 

12780 

12840 

12900 

12960 

13020 

13080 

13140 

13200 

13260 

13320 

13380 

13440 

13500 

13560 

13620 

13680 

13740 

13800 

13860 

13920 

13980 

14040 

14100 

14160 

14220 

14280 

14340 

14400 


14460 



GATGATGATG 

TTGTTTTAGT 

TTCCAAAATT 

ATACCTACAA 

TCACTCCGAA 

TTGGTCTAGA 

14520 

ATACTTTATC 

TATATAAATT 

ATATTGGAAA 

GATGTAAGTA 

TAATATCACT 

CAAAACTTCT 

14580 

AATCCTGCAT 

CAACAGAATT 

ATATCTAATT 

TCGAAAGATG 

CATATTGTAC 

TATAATGGAA 

14640 

CCTAGTGAAA 

TTGTTTTATC 

AAAACTTAAA 

AGATTGTCAC 

TCTTGGAAGA 

AAATAATC TA 

14700 

TTAAAATGGA 

TCATTTTATC 

AAAGAAGAGG 

AATAATGAAT 

GGTTACATCA 

TGAAAT C AAA 

14760 

GAAGGAGAAA 

GAGATTATGG 

AATCATGAGA 

CCATATCATA 

TGGCACTACA 

AATCTTTGGA 

14820 

TTTCAAATCA 

ATTTAAATCA 

TCTGGCGAAA 

GAATTTTTAT 

CAACCCCAGA 

TCTGACTAAT 

14880 

ATCAACAATA 

TAATCCAAAG 

TTTTCAGCGA 

ACAATAAAGG 

ATGTTTTATT 

TGAATGGATT 

14940 

AATATAACTC 

ATGATGATAA 

GAGACATAAA 

TTAGGCGGAA 

GATATAACAT 

ATTCCCACTG 

15000 

AAAAATAAGG 

GAAAGTTAAG 

ACTGCTATCG 

AGAAGACTAG 

TATTAAGTTG 

GATTTCATTA 

15060 

TCATTATCGA 

CTCGATTACT 

TACAGGTCGC 

TTTCCTGATG 

AAAAATTTGA 

ACATAGAGCA 

15120 

CAGACTGGAT 

ATGTATCATT 

AGCTGATACT 

GATTTAGAAT 

CATTAAAGTT 

ATTGTCGAAA 

15180 

AACATCATTA 

AGAATTACAG 

AGAGTGTATA 

GGATCAATAT 

CATATTGGTT 

TCTAACCAAA 

15240 

GAAGTTAAAA 

TACTTATGAA 

ATTGATCGGT 

GGTGCTAAAT 

TATTAGGAAT 

TCCCAGACAA 

15300 

TATAAAGAAC 

CCGAAGACCA 

GTTATTAGAA 

AACTACAATC 

AACATGATGA 

ATTTGATATC 

15360 

GATTAAAACA 

TAAATACAAT 

GAAGATATAT 

CCTAACCTTT 

ATCTTTAAGC 

CTAGGAATAG 

15420 

ACAAAAAGTA 

AGAAAAACAT 

GTAATATATA 

TATACCAAAC 

AGAGTTCTTC 

TCTTGTTTGG 

15480 

TGGGTCGGCA 

TGGCATCTCC 

ACCTCCTCGC 

GGTCCGGAC C 

TGGGCATCCG 

AAGGAGGACG 

15540 

CACGTCCACT 

CGGATGG C TA 

AGGGAGAGCC 

TGCAGTAGCA 

TAACCCCTTG 

GGGCCTCTAA 

15600 

ACGGGTCTTG 

AGGGGTTTTT 

TGCTGAAAGG 

AGGAACTATA 

TACGCGTCGA 

CGGGCCCCGC 

15660 

GCTCAC 






15666 


(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: CDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

GGATTTGCGC GCAATTTAAA TCATCTGG 28 

(2) INFORMATION FOR SEQ ID NO : 17 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 70 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 


(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

CCCAGGTCGG ACCGCGAGGA GGTGGAGATG CCATGCCAGC CCACCAAAAC AAGAGAAGAA 60 

CTCTGTTTGG 7 0 

(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 45 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:18: 

GGCCCGTCGA CGCGTAATAC GACTCACTAT AGGACCAAAC AAGAG 45 

(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:19: 

CGGCATCACG TGCTAC 16 

(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6843 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 

ATGTTGAGCC TATTTGATAC ATTTAATGCA CGTAGGCAAG AAAACATAAC AAAATCAGCC 60 
GGTGGAGCTA TCATTCCTGG ACAGAAAAAT ACTGTCTCTA TATTCGCCCT TGGACCGACA 120 
ATAACTGATG ATAATGAGAA AATGACATTA GCTCTTCTAT TTCTATCTCA TTCACTAGAT 180 
AATGAGAAAC AACATGCACA AAGGGCAGGG TTCTTGGTGT CTTTATTGTC AATGGCTTAT 240 
GCCAATCCAG AGCTCTACCT AACAACAAAT GGAAGTAATG CAGATGTCAA GTATGTCATA 300 
TACATGATTG AGAAAGATCT AAAACGGCAA AAGTATGGAG GATTTGTGGT TAAGACGAGA 360 
GAGATGATAT ATGAAAAGAC AACTGATTGG ATATTTGGAA GTGACCTGGA TTATGATCAG 420 
GAAACTATGT TGCAGAACGG CAGGAACAAT TCAACAATTG AAGACCTTGT CCACACATTT 


480 



176 


GGGTATCCAT 

CATGTTTAGG 

AGCTCTTATA 

ATACAGATCT 

GGATAGTTCT 

GGTCAAAGCT 

540 

ATCACTAGTA 

TCTCAGGGTT 

AAGAAAAGGC 

TTTTTCACCC 

GATTGGAAGC 

TTTCAGACAA 

600 

GATGGAACAG 

TGCAGGCAGG 

GCTGGTATTG 

AGCGGTGACA 

CAGTGGATCA 

GATTGGGTCA 

660 

ATCATGCGGT 

CTCAACAGAG 

CTTGGTAACT 

CTTATGGTTG 

AAACATTAAT 

AACAATGAAT 

720 

ACCAGCAGAA 

ATGACCTCAC 

AACCATAGAA 

AAGAATATAC 

AAATTGTTGG 

CAACTACATA 

780 

AGAGATGCAG 

GTCTCGCTTC 

ATTCTTCAAT 

ACAATCAGAT 

ATGGAATTGA 

GACCAGAATG 

840 

GCAGCTTTGA 

CTCTATCCAC 

TCTCAGACCA 

GATATCAATA 

GATTAAAAGC 

TTTGATGGAA 

900 

CTGTATTTAT 

CAAAGGGACC 

ACGCGCTCCT 

TTCATCTGTA 

TCCTCAGAGA 

TCCTATACAT 

960 

GGTGAGTTCG 

CACCAGGCAA 

CTATCCTGCC 

ATATGGAGCT 

ATGCAATGGG 

GGTGGCAGTT 

1020 

GTACAAAATA 

GAGCCATGCA 

ACAGTATGTG 

ACGGGAAGAT 

CATATCTAGA 

CATTGATATG 

1080 

TTCCAGCTAG 

GACAAGCAGT 

AG C ACGTG AT 

GCCGAAGCTC 

AAATGAGCTC 

AACACTGGAA 

1140 

GATGAACTTG 

GAGTGACACA 

CGAATCTAAA 

GAAAGC TTGA 

AGAGACATAT 

AAGGAACATA 

1200 

AACAGTTCAG 

AGACATCTTT 

CCACAAACCG 

ACAGGTGGAT 

CAGCCATAGA 

GATGGCAATA 

1260 

GATGAAGAGC 

CAGAACAATT 

CGAACATAGA 

GCAGATCAAG 

AACAAAATGG 

AGAACCTCAA 

1320 

TCATCCATAA 

TTCAATATGC 

CTGGGCAGAA 

GGAAATAGAA 

GCGATGATCA 

GACTGAGCAA 

1380 

GCTACAGAAT 

CTGACAATAT 

CAAGACCGAA 

CAACAAAACA 

TCAGAGACAG 

ACTAAACAAG 

1440 

AGACTCAACG 

ACAAGAAGAA 

ACAAAGCAGT 

CAACCACCCA 

CTAATC C C AC 

AAACAGAACA 

1500 

AACCAGGACG 

AAATAGATGA 

TCTGTTTAAC 

GCATTTGGAA 

GCAACTAAGT 

CGACGATCCG 

1560 

GCTGCTAACA 

AAGCCCGAAA 

GGAAGCTGAG 

TTGGCTGCTG 

CCACCGCTGA 

GCAATAACTA 

1620 

GCATAACCCC 

TTGGGGCCTC 

TAAACGGGTC 

TTGAGGGGTT 

TTTTGCTGAA 

AGGAGGAACT 

1680 

ATATCCGGAT 

CGAGATCAAT 

TCTGTGAGCG 

TATGGCAAAC 

GAAGGAAAAA 

TAGTTATAGT 

1740 

AGCCGCACTC 

GATGGGACAT 

TTCAACGTAA 

AC CGTTTAAT 

AATATTTTGA 

ATCTTATTCC 

1800 

ATTATCTGAA 

ATGGTGGTAA 

AACTAACTGC 

TGTGTGTATG 

AAATGCTTTA 

AGGAGGCTTC 

1860 

CTTTTCTAAA 

CGATTGGGTG 

AGGAAACCGA 

GATAGAAATA 

ATAGGAGGTA 

ATGATATGTA 

1920 

TCAATCGGTG 

TGTAGAAAGT 

GTTACATCGA 

CTCATAATAT 

TATATTTTTT 

ATCTAAAAAA 

1980 

CTAAAAATAA 

ACATTGATTA 

AATTTTAATA 

TAATACTTAA 

AAATGGATGT 

TGTGTCGTTA 

2040 

GATAAAC CGT 

TTATGTATTT 

TGAGGAAATT 

GATAATGAGT 

TAGATTACGA 

AC C AGAAAGT 

2100 

GCAAATGAGG 

TCGCAAAAAA 

ACTGCCGTAT 

CAAGGACAGT 

TAAAACTATT 

ACTAGGAGAA 

2160 

TTATTTTTTC 

TTAGTAAGTT 

ACAGCGACAC 

GGTATATTAG 

ATGGTGCCAC 

CGTAGTGTAT 

2220 

ATAGGATCTG 

CTCCCGGTAC 

ACATATACGT 

TATTTGAGAG 

ATCATTTCTA 

TAATTTAGGA 

2280 

GTGATCATCA 

AATGGATGCT 

AATTGACGGC 

CGCCATCATG 

ATCCTATTTT 

AAATGGATTG 

2340 

CGTGATGTGA 

CTCTAGTGAC 

TCGGTTCGTT 

GATGAGGAAT 

ATCTACGATC 

CATCAAAAAA 

2400 

CAACTGCATC 

CTTCTAAGAT 

TATTTTAATT 

TCTGATGTGA 

GATCCAAACG 

AGGAGGAAAT 

2460 

GAACCTAGTA 

CGGCGGATTT 

ACTAAGTAAT 

TACGCTCTAC 

AAAATGTCAT 

GATTAGTATT 

2520 

TTAAACCCCG 

TGGCGTCTAG 

TCTTAAATGG 

AGATGCCCGT 

TTCCAGATCA 

ATGGATCAAG 

2580 



177 


GACTTTTATA 

TCCCACACGG 

TAATAAAATG 

TTACAACCTT 

TTGCTCCTTC 

ATATTCAGGG 

2640 

CCGTCGTTTT 

ACAACGTCGT 

GACTGGGAAA 

ACCCTGGCGT 

TACCCAACTT 

AATCGCCTTG 

2700 

CAGCACATCC 

CCCTTTCGCC 

AGCTGGCGTA 

AT AG CGAAG A 

GGCCCGCACC 

GATCGCCCTT 

2760 

CCCAACAGTT 

GCGCAGCCTG 

AATGGCGAAT 

GGCGCGACGC 

GCCCTGTAGC 

GGCGCATTAA 

2820 

GCGCGGCGGG 

TGTGGTGGTT 

ACGCGCAGCG 

TGAC CGCTAC 

ACTTGCCAGC 

GCCCTAGCGC 

2880 

CCGCTCCTTT 

CGCTTTCTTC 

CCTTCCTTTC 

TCGCCACGTT 

CGCCGGCTTT 

CCCCGTCAAG 

2940 

CTCTAAATCG 

GGGGCTCCCT 

TTAGGGTTCC 

GATTTAGTGC 

TTTACGGCAC 

CTCGACCCCA 

3000 

AAAAACTTGA 

TTAGGGTGAT 

GGTTCACGTA 

GTGGGCCATC 

GCCCTGATAG 

ACGGTTTTTC 

3060 

GCCCTTTGAC 

GTTGGAGTCC 

ACGTTCTTTA 

ATAGTGGACT 

CTTGTTCCAA 

ACTGGAACAA 

3120 

CACTCAACCC 

TATCTCGGTC 

TATTCTTTTG 

ATTTATAAGG 

GATTTTGCCG 

ATTTCGGCCT 

3180 

ATTGGTTAAA 

AAATGAGCTG 

ATTTAACAAA 

AATTTAACGC 

GAATTTTAAC 

AAAATATTAA 

3240 

CGTTTACAAT 

TTCCCAGGTG 

GCACTTTTCG 

GGGAAATGTG 

CGCGGAACCC 

CTATTTGTTT 

3300 

ATTTTTCTAA 

ATACATTCAA 

ATATGTATCC 

GCTCATGAGA 

CAATAACCCT 

GAT AAATGC T 

3360 

TCAATAATAT 

TGAAAAAGGA 

AGAGTATGAG 

TATTCAACAT 

TTCCGTGTCG 

CCCTTATTCC 

3420 

CTTTTTTGCG 

GCATTTTGCC 

TTCCTGTTTT 

TGCTCACCCA 

GAAACGCTGG 

TGAAAGTAAA 

3480 

AGATGCTGAA 

GATCAGTTGG 

GTGCACGAGT 

GGGTTACATC 

GAACTGGATC 

TCAACAGCGG 

3540 

TAAGATCCTT 

GAGAGTTTTC 

GCCCCGAAGA 

ACGTTTTCCA 

ATGATGAGCA 

CTTTTAAAGT 

3600 

TCTGCTATGT 

GGCGCGGTAT 

TATCCCGTAT 

TGACGCCGGG 

CAAGAGCAAC 

TCGGTCGCCG 

3660 

CATACACTAT 

TCTCAGAATG 

ACTTGGTTGA 

GTACTCACCA 

GTCACAGAAA 

AGCATCTTAC 

3720 

GGATGGCATG 

ACAGTAAGAG 

AATTATGCAG 

TGCTGCCATA 

ACCATGAGTG 

ATAACACTGC 

3780 

GGCCAACTTA 

cttctgacaa 

CGATCGGAGG 

ACCGAAGGAG 

CTAACCGCTT 

TTTTGCACAA 

3840 

CATGGGGGAT 

CATGTAACTC 

gccttgatcg 

TTGGGAACCG 

GAGCTGAATG 

AAGCCATACC 

3900 

AAACGACGAG 

CGTGACAC CA 

CGATGCCTGT 

AGCAATGGCA 

ACAACGTTGC 

GCAAACTATT 

3960 

AACTGGCGAA 

CTACTTACTC 

TAGCTTCCCG 

GCAACAATTA 

ATAGACTGGA 

TGGAGGCGGA 

4020 

TAAAGTTGCA 

GGACCACTTC 

TGCGCTCGGC 

CCTTCCGGCT 

GGCTGGTTTA 

TTGCTGATAA 

4080 

ATCTGGAGCC 

GGTGAGCGTG 

GGTCTCGCGG 

TATCATTGCA 

GCACTGGGGC 

CAGATGGTAA 

4140 

GCCCTCCCGT 

ATCGTAGTTA 

TCTACACGAC 

GGGGAGTCAG 

GCAACTATGG 

ATGAACGAAA 

4200 

TAGACAGATC 

GCTGAGATAG 

GTGCCTCACT 

GATTAAGCAT 

TGGTAACTGT 

CAGACCAAGT 

4260 

TTACTCATAT 

ATACTTTAGA 

TTGATTTAAA 

ACTTCATTTT 

TAATTTAAAA 

GGATCTAGGT 

4320 

GAAGATCCTT 

TTTGATAATC 

TCATGACCAA 

AATCCCTTAA 

CGTGAGTTTT 

CGTTCCACTG 

4380 

AGCGTCAGAC 

CCCGTAGAAA 

AGATCAAAGG 

atcttcttga 

GATCCTTTTT 

TTCTGCGCGT 

4440 

AATCTGCTGC 

TTGCAAACAA 

AAAAACCACC 

GCTACCAGCG 

GTGGTTTGTT 

TGCCGGATCA 

4500 

AGAGCTACCA 

ACTCTTTTTC 

CGAAGGTAAC 

TGGCTTCAGC 

AGAGCGCAGA 

TACCAAATAC 

4560 

TGTCCTTCTA 

GTGTAGCCGT 

AGTTAGGCCA 

CCACTTCAAG 

AACTCTGTAG 

CACCGCCTAC 

4620 

ATACCTCGCT 

CTGCTAATCC 

TGTTACCAGT 

GGCTGCTGCC 

AGTGGCGATA 

AGTCGTGTCT 

4680 
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TACCGGGTTG 

GACTCAAGAC 

GATAGTTACC 

GGATAAGGCG 

CAGCGGTCGG 

GCTGAACGGG 

4740 

GGGTTCGTGC 

ACACAGCCCA 

GCTTGGAGCG 

AACGACCTAC 

AC CGAACTGA 

GATACCTACA 

4800 

GCGTGAGCTA 

TGAGAAAG CG 

CCACGCTTCC 

CGAAGGGAGA 

AAGGCGGACA 

GGTATCCGGT 

4860 

AAG CGGC AGG 

GTCGGAACAG 

GAGAGCGCAC 

GAGGGAGCTT 

CCAGGGGGAA 

ACGCCTGGTA 

4920 

TCTTTATAGT 

CCTGTCGGGT 

TTCGCCACCT 

CTGACTTGAG 

CGTCGATTTT 

TGTGATGCTC 

4980 

GTCAGGGGGG 

CGGAGCCTAT 

GGAAAAACGC 

CAGCAACGCG 

GCCTTTTTAC 

GGTTCCTGGC 

5040 

CTTTTGCTGG 

CCTTTTGCTC 

ACATGTTCTT 

TCCTGCGTTA 

TCCCCTGATT 

CTGTGGATAA 

5100 

CCGTATTACC 

gcctttgagt 

GAGCTGATAC 

CGCTCGCCGC 

AGCCGAACGA 

CCGAGCGCAG 

5160 

CGAGTCAGTG 

AGCGAGGAAG 

CGGAAGAGCG 

CCCAATACGC 

AAACCGCCTC 

TCCCCGCGCG 

5220 

TTGGCCGATT 

CATTAATGCA 

GCTGGCACGA 

CAGGTTTCCC 

GACTGGAAAG 

CGGGCAGTGA 

5280 

GCGCAACGCA 

ATTAATGTGA 

GTTAGCTCAC 

TCATTAGGCA 

CCCCAGGCTT 

TACACTTTAT 

5340 

GCTTCCGGCT 

CGTATGTTGT 

GTGGAATTGT 

GAGCGGATAA 

CAATTTCACA 

CAGGAAACAG 

5400 

CTATGAC CAT 

GATTACGCCA 

AGCTTTTGCG 

ATCAATAAAT 

GGATCACAAC 

CAGTATCTCT 

5460 

TAACGATGTT 

CTTCGCAGAT 

GATGATTCAT 

TTTTTAAGTA 

TTTGGCTAGT 

CAAGATGATG 

5520 

AATCTTCATT 

ATCTGATATA 

TTGCAAATCA 

CTCAATATCT 

AGACTTTCTG 

TTATTATTAT 

5580 

TGATC CAATC 

AAAAAATAAA 

TTAGAAGCCG 

TGGGTCATTG 

TT ATGAATC T 

CTTTCAGAGG 

5640 

AATACAGACA 

ATTGACAAAA 

TTCACAGACT 

TTCAAGATTT 

TAAAAAACTG 

TTTAACAAGG 

5700 

TCCCTATTGT 

TACAGATGGA 

AGGGTCAAAC 

TTAATAAAGG 

AT ATTTGTT C 

GACTTTGTGA 

5760 

TTAGTTTGAT 

GCGATTCAAA 

AAAGAATCCT 

CTCTAGCTAC 

CACCGCAATA 

GATC CTGTTA 

5820 

GATACATAGA 

TCCTCGTCGC 

AATATCGCAT 

TTTCTAACGT 

GATGGATATA 

TTAAAGTCGA 

5880 

ATAAAGTGAA 

CAATAATTAA 

TTCTTTATTG 

TCATCATGAA 

CGGCGGACAT 

ATTCAGTTGA 

5940 

TAATCGGCCC 

CATGTTTTCA 

GGTAAAAGTA 

CAGAATTAAT 

TAGACGAGTT 

AGACGTTATC 

6000 

AAATAGCTCA 

ATATAAATGC 

GTGACTATAA 

AATATTCTAA 

CGATAATAGA 

TACGGAACGG 

6060 

GACTATGGAC 

GCATGATAAG 

AATAATTTTG 

AAGCATTGGA 

AGCAACTAAA 

CTATGTGATG 

6120 

TCTTGGAATC 

AATTACAGAT 

TTCTCCGTGA 

TAGGTATCGA 

TGAAGGACAG 

TTCTTTCCAG 

6180 

ACATTGTTGA 

ATTGATCTCG 

ATCCCGCGAA 

ATTAATACGA 

CTCACTATAG 

GGAGACCACA 

6240 

ACGGTTTCCC 

TCTAGCGGGA 

TCAATTCCGC 

CCCTCTCCCT 

CCCCCCCCCC 

TAACGTTACT 

6300 

GGCCGAAGCC 

GCTTGGAATA 

AGGCCGGTGT 

GCGTTTGTCT 

ATATGTTATT 

TTCCACCATA 

6360 

TTGCCGTCTT 

TTGGCAATGT 

GAGGGCCCGG 

AAACCTGGCC 

CTGTCTTCTT 

GACGAGCATT 

6420 

CCTAGGGGTC 

TTTCCCCTCT 

CGCCAAAGGA 

ATGCAAGGTC 

TGTTGAATGT 

CGTGAAGGAA 

6480 

GCAGTTCCTC 

TGGAAGCTTC 

TTGAAGACAA 

ACAACGTCTG 

TAGCGACCCT 

TTGCAGGCAG 

6540 

CGGAACCCCC 

CACCTGGCGA 

CAGGTGCCTC 

TGCGGCCAAA 

AGCCACGTGT 

ATAAGATACA 

6600 

CCTGCAAAGG 

CGGCACAACC 

CCAGTGCCAC 

GTTGTGAGTT 

GGATAGTTGT 

GGAAAGAGTC 

6660 

AAATGGCTCT 

CCTCAAGCGT 

ATTCAACAAG 

GGGCTGAAGG 

ATGCCCAGAA 

GGTACCCCAT 

6720 

TGTATGGGAT 

CTGATCTGGG 

GCCTCGGTGC 

ACATGCTTTA 

CATGTGTTTA 

GTCGAGGTTA 

6780 
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AAAAACGTCT AGGCCCCCCG AACCACGGGG ACGTGGTTTT CCTTTGAAAA ACACGATAAT 684 0 
ACC 6843 

(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7107 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 


ATGGAAAGCG 

ACGCTAAAAA 

CTATCAAATC 

ATGGATTCTT 

GGGAAGAGGA 

ATCAAGAGAT 

60 

AAATCAACTA 

ATATCTCCTC 

GGCCCTCAAC 

ATCATTGAAT 

TCATACTCAG 

CACCGACCCC 

120 

CAAGAAGACT 

TAT CGGAAAA 

CGACACAATC 

AACACAAGAA 

CCCAGCAACT 

CAGTGCCACC 

180 

ATCTGTCAAC 

CAGAAATCAA 

ACCAACAGAA 

ACAAGTGAGA 

AAGATAGTGG 

ATCAACTGAC 

240 

AAAAATAGAC 

AGTCCGGGTC 

ATCACACGAA 

TGTACAACAG 

AAG C AAAAG A 

TAGAAATATT 

300 

GATCAGGAAA 

CTGTACAGAG 

AGGACCTGGG 

AGAAGAAGCA 

GCTCAGATAG 

TAGAGCTGAG 

360 

ACTGTGGTCT 

CTGGAGGAAT 

CCCCAGAAGC 

ATCACAGATT 

CTAAAAATGG 

AACCCAAAAC 

420 

ACGGAGGATA 

TTGATCTCAA 

TGAAATTAGA 

AAGATGGATA 

AGGACTCTAT 

TGAGGGGAAA 

480 

ATGCGACAAT 

CTGCAAATGT 

TCCAAGCGAG 

ATATCAGGAA 

GTGATGACAT 

ATTTACAACA 

540 

GAACAAAGTA 

GAAACAGTGA 

TCATGGAAGA 

AGCCTGGAAT 

CTATCAGTAC 

ACCTGATACA 

600 

AGATCAATAA 

GTGTTGTTAC 

TGCTGCAACA 

CCAGATGATG 

AAGAAGAAAT 

ACTAATGAAA 

66 0 

AATAGTAGGA 

CAAAGAAAAG 

TTCTTCAACA 

CATCAAGAAG 

ATGACAAAAG 

AATTAAAAAA 

720 

GGGGGAAAAG 

GGAAAGACTG 

GTTTAAGAAA 

TCAAAAGATA 

C CGACAACCA 

GATACCAACA 

780 

TCAGACTACA 

GATCCACATC 

AAAAGGGCAG 

AAGAAAATCT 

CAAAGACAAC 

AACCACCAAC 

840 

ACCGACACAA 

AGGGG C AAAC 

AGAAATACAG 

ACAGAATCAT 

CAGAAACACA 

ATCCTCATCA 

900 

TGGAATCTCA 

TCATCGACAA 

CAACACCGAC 

CGGAACGAAC 

AGACAAGCAC 

AACTCCTCCA 

960 

ACAACAACTT 

CCAGATCAAC 

TTATACAAAA 

GAATCGATCC 

GAACAAACTC 

TGAATCCAAA 

1020 

CCCAAGACAC 

AAAAG AC AAA 

TGGAAAGGAA 

AGGAAGGATA 

CAGAAGAGAG 

CAATCGATTT 

1080 

ACAGAGAGGG 

CAATTACTCT 

ATTGCAGAAT 

CTTGGTGTAA 

TTCAATCCAC 

ATCAAAACTA 

1140 

GATTTATATC 

AAGACAAACG 

AGTTGTATGT 

GTAGCAAATG 

TACTAAACAA 

TGTAGATACT 

1200 

GCATCAAAGA 

TAGATTTCCT 

GGCAGGATTA 

GTCATAGGGG 

TTTCAATGGA 

CAACGACACA 

1260 

AAATTAACAC 

AGATACAAAA 

TGAAATGCTA 

AACCTCAAAG 

CAGATCTAAA 

GAAAATGGAC 

1320 

GAATCACATA 

GAAGATTGAT 

AGAAAATCAA 

AGAGAACAAC 

TGTCATTGAT 

CACGTCACTA 

1380 

ATTTCAAATC 

TCAAAATTAT 

GACTGAGAGA 

GGAGGAAAGA 

AAGACCAAAA 

TGAATCCAAT 

1440 

GAGAGAGTAT 

CCATGATCAA 

AACAAAATTG 

AAAGAAGAAA 

AGATCAAGAA 

GACCAGGTTT 

1500 

GACCCACTTA 

TGGAGGCACA 

AGGCATTGAC 

AAGAATATAC 

CCGATCTATA 

TCGACATGCA 

1560 
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GGAGATACAC 

TAGAGAACGA 

TGTACAAGTT 

AAATCAGAGA 

TATTAAGTTC 

ATACAATGAG 

1620 

TCAAATGCAA 

CAAGACTAAT 

ACCCAAAAAA 

GTGAGCAGTA 

CAATGAGATC 

ACTAGTTGCA 

1680 

GTCATCAACA 

ACAGCAATCT 

CTCACAAAGC 

ACAAAACAAT 

CATACATAAA 

CGAACTCAAA 

1740 

CGTTGCAAAA 

ATGATGAAGA 

AGTATCTGAA 

TTAATGGACA 

TGTTCAATGA 

AGATGTCAAC 

1800 

AATTGCCAAT 

GAGTCGACGA 

TCCGGCTGCT 

AACAAAGCCC 

GAAAGGAAGC 

TGAGTTGGCT 

1860 

GCTGCCACCG 

CTGAGCAATA 

ACTAGCATAA 

CCCCTTGGGG 

CCTCTAAACG 

GGTCTTGAGG 

1920 

GGTTTTTTGC 

TGAAAGGAGG 

AACTATATCC 

GGATCGAGAT 

CAATTCTGTG 

AGCGTATGGC 

1980 

AAACGAAGGA 

AAAATAGTTA 

TAGTAGC CGC 

ACTCGATGGG 

ACATTTCAAC 

GTAAACCGTT 

2040 

TAATAATATT 

TTGAATCTTA 

TTCCATTATC 

TGAAATGGTG 

GTAAAACTAA 

CTGCTGTGTG 

2100 

TATGAAATGC 

TTTAAGGAGG 

CTTCCTTTTC 

TAAACGATTG 

GGTGAGGAAA 

CCGAGATAGA 

2160 

AATAATAGGA 

GGTAATGATA 

TGTATCAATC 

GGTGTGTAGA 

AAGTGTTACA 

TCGACTCATA 

2220 

ATATTATATT 

TTTTATCTAA 

AAAACTAAAA 

ATAAACATTG 

ATTAAATTTT 

AATATAATAC 

2280 

TTAAAAATGG 

ATGTTGTGTC 

GTTAGATAAA 

CCGTTTATGT 

AT TTTG AGG A 

AATTGATAAT 

2340 

GAGTTAGATT 

ACGAACCAGA 

AAGTGCAAAT 

GAGGTCGCAA 

AAAAACTGCC 

GTATCAAGGA 

2400 

CAGTTAAAAC 

TATTACTAGG 

AGAATTATTT 

TTTCTTAGTA 

AGTTACAGCG 

ACACGGTATA 

2460 

TTAGATGGTG 

CCACCGTAGT 

GTATATAGGA 

TCTGCTCCCG 

GTACACATAT 

ACGTTATTTG 

2520 

AGAGATCATT 

TCTATAATTT 

AGGAGTGATC 

ATCAAATGGA 

TGCTAATTGA 

CGGCCGCCAT 

2580 

CATGATCCTA 

TTTTAAATGG 

ATTGCGTGAT 

GTGACTCTAG 

TGACTCGGTT 

CGTTGATGAG 

2640 

GAATATCTAC 

GATCCATCAA 

AAAACAACTG 

CATCCTTCTA 

AGATTATTTT 

AATTTCTGAT 

2700 

GTGAGATCCA 

AACGAGGAGG 

AAATGAACCT 

AGTA CGGCGG 

ATTTACTAAG 

TAATTACGCT 

2760 

CTACAAAATG 

TCATGATTAG 

TATTTTAAAC 

CCCGTGGCGT 

CTAGTCTTAA 

ATGGAGATGC 

2820 

CCGTTTCCAG 

ATCAATGGAT 

CAAGGACTTT 

TATATCC C AC 

ACGGTAATAA 

AATGTTACAA 

2880 

CCTTTTGCTC 

CTTCATATTC 

AGGGCCGTCG 

TTTTACAACG 

TCGTGACTGG 

GAAAACCCTG 

2940 

GCGTTACCCA 

ACTTAATCGC 

CTTGCAGCAC 

ATCCCCCTTT 

CGCCAGCTGG 

CGTAATAGCG 

3000 

AAGAGGCCCG 

CACCGATCGC 

CCTTCCCAAC 

AGTTGCGCAG 

CCTGAATGGC 

GAATGGCGCG 

3060 

ACGCGCCCTG 

TAGCGGCGCA 

TTAAGCGCGG 

CGGGTGTGGT 

GGTTACGCGC 

AGCGTGACCG 

3120 

CTACACTTGC 

CAGCGCCCTA 

GCGCCCGCTC 

CTTTCGCTTT 

CTTCCCTTCC 

TTTCTCGCCA 

3180 

CGTTCGCCGG 

CTTTCCCCGT 

CAAGCTCTAA 

ATCGGGGGCT 

CCCTTTAGGG 

TTCCGATTTA 

3240 

GTGCTTTACG 

GCACCTCGAC 

CCCAAAAAAC 

TTGATTAGGG 

TGATGGTTCA 

CGTAGTGGGC 

3300 

CATCGCCCTG 

ATAGACGGTT 

TTTCGCCCTT 

TGACGTTGGA 

GTCCACGTTC 

TTTAATAGTG 

3360 

GACTCTTGTT 

CCAAACTGGA 

ACAACACTCA 

ACCCTATCTC 

GGTCTATTCT 

TTTGATTTAT 

3420 

AAGGGATTTT 

GCCGATTTCG 

GCCTATTGGT 

TAAAAAATGA 

GCTGATTTAA 

CAAAAATTTA 

3480 

ACGCGAATTT 

TAACAAAATA 

TTAACGTTTA 

CAATTTCCCA 

GGTGGCACTT 

TTCGGGGAAA 

3540 

TGTGCGCGGA 

ACCCCTATTT 

GTTTATTTTT 

CTAAATACAT 

TCAAATATGT 

ATCCGCTCAT 

3600 

GAGACAATAA 

CCCTGATAAA 

TGCTTCAATA 

ATATTGAAAA 

AGGAAGAGTA 

TGAGTATTCA 

3660 
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ACATTTCCGT 

GTCGCCCTTA 

TTCCCTTTTT 

TGCGGCATTT 

TGCCTTCCTG 

TTTTTGCTCA 

3720 

CCCAGAAACG 

CTGGTGAAAG 

TAAAAGATGC 

TGAAGATCAG 

TTGGGTGCAC 

GAGTGGGTTA 

3780 

CATCGAACTG 

GATCTCAACA 

GCGGTAAGAT 

C CTTGAGAGT 

TTTCGCCCCG 

AAGAACGTTT 

3840 

TCCAATGATG 

AGCACTTTTA 

AAGTTCTGCT 

ATGTGGCGCG 

GTATTATCCC 

GTATTGACGC 

3900 

CGGGCAAGAG 

CAACTCGGTC 

GCCGCATACA 

CTATTCTCAG 

AATGACTTGG 

TTGAGTACTC 

3960 

ACCAGTCACA 

GAAAAGCATC 

TTACGGATGG 

CATGACAGTA 

AGAGAATTAT 

GCAGTGCTGC 

4020 

CATAACCATG 

AGTGATAACA 

CTGCGGCCAA 

CTTACTTCTG 

ACAACGATCG 

GAGGACCGAA 

4080 

GGAGCTAACC 

GCTTTTTTGC 

ACAACATGGG 

GGATCATGTA 

ACTCGCCTTG 

ATCGTTGGGA 

4140 

ACCGGAGCTG 

AATGAAGCCA 

TACCAAACGA 

CGAGCGTGAC 

ACCACGATGC 

CTGTAGCAAT 

4200 

GGCAACAACG 

TTGCGCAAAC 

TATTAACTGG 

CGAACTACTT 

ACTCTAGCTT 

CCCGGCAACA 

4260 

ATTAATAGAC 

TGGATGGAGG 

CGGATAAAGT 

TGCAGGACCA 

CTTCTGCGCT 

CGGCCCTTCC 

4320 

GGCTGGCTGG 

TTTATTGCTG 

ATAAATCTGG 

AGCCGGTGAG 

CGTGGGTCTC 

GCGGTATCAT 

4380 

TGCAGCACTG 

GGGCCAGATG 

GTAAGCCCTC 

CCGTATCGTA 

GTTATCTACA 

CGACGGGGAG 

4440 

TCAGGCAACT 

ATGGATGAAC 

GAAATAGACA 

GATCGCTGAG 

ATAGGTGCCT 

CACTGATTAA 

4500 

GCATTGGTAA 

CTGTCAGACC 

AAGTTTACTC 

AT AT AT AC TT 

TAGATTGATT 

TAAAACTTCA 

4560 

TTTTTAATTT 

AAAAGGATCT 

AGGTGAAGAT 

CCTTTTTGAT 

AATCTCATGA 

CCAAAATCCC 

4620 

TTAACGTGAG 

TTTTCGTTCC 

ACTGAGCGTC 

AGACCCCGTA 

GAAAAGATCA 

AAGGATCTTC 

4680 

TTGAGATCCT 

TTTTTTCTGC 

GCGTAATCTG 

CTGCTTGCAA 

ACAAAAAAAC 

CACCGCTACC 

4740 

AGCGGTGGTT 

TGTTTGCCGG 

ATCAAGAGCT 

ACCAACTCTT 

TTTC CG AAGG 

T AACTGGC TT 

4800 

CAGCAGA GCG 

CAGATACCAA 

ATACTGTCCT 

TCTAGTGTAG 

CCGTAGTTAG 

GCCACCACTT 

4860 

CAAGAACTCT 

GTAGCACCGC 

CTACATACCT 

CGCTCTGCTA 

ATCCTGTTAC 

CAGTGGCTGC 

4920 

TGCCAGTGGC 

GATAAGTCGT 

GTCTTACCGG 

GTTGGACTCA 

AGACGATAGT 

T AC CGGATAA 

4980 

GGCGCAGCGG 

TCGGGCTGAA 

CGGGGGGTTC 

GTGCACACAG 

CCCAGCTTGG 

AGCGAACGAC 

5040 

CTACACCGAA 

CTGAGATACC 

TACAGCGTGA 

GCTATGAGAA 

AGCGCCACGC 

TTCCCGAAGG 

5100 

GAGAAAGGCG 

GACAGGTATC 

CGGTAAGCGG 

CAGGGTCGGA 

ACAGGAGAGC 

GCACGAGGGA 

5160 

GCTTCCAGGG 

GGAAACGC CT 

GGTATCTTTA 

TAGTCCTGTC 

GGGTTTCGCC 

acctctgact 

5220 

TGAGCGTCGA 

TTTTTGTGAT 

GCTCGTCAGG 

GGGGCGGAGC 

CTATGGAAAA 

ACGCCAGCAA 

5280 

CGCGGCCTTT 

TTACGGTTCC 

TGGCCTTTTG 

CTGGCCTTTT 

GCTCACATGT 

TCTTTCCTGC 

5340 

GTTATCCCCT 

GATTCTGTGG 

ATAAC CGTAT 

TACCGCCTTT 

GAGTGAGCTG 

ATACCGCTCG 

5400 

CCGCAGCCGA 

ACGACCGAGC 

GCAGCGAGTC 

AGTGAGCGAG 

GAAGCGGAAG 

AGCGCCCAAT 

5460 

ACGCAAACCG 

CCTCTCCCCG 

CGCGTTGGCC 

GATTCATTAA 

TGCAGCTGGC 

ACGACAGGTT 

5520 

TCCCGACTGG 

AAAGC GGGC A 

GTGAGCGCAA 

CGCAATTAAT 

GTGAGTTAGC 

TCACTCATTA 

5580 

GGCACCCCAG 

gctttacact 

TTATGCTTCC 

GGCTCGTATG 

TTGTGTGGAA 

TTGTGAGCGG 

5640 

ATAACAATTT 

CACACAGGAA 

ACAGCTATGA 

C C ATG ATT AC 

GCCAAGCTTT 

TGCGATCAAT 

5700 

AAATGGATCA 

CAACCAGTAT 

CTCTTAACGA 

TGTTCTTCGC 

AGATGATGAT 

TCATTTTTTA 

5760 
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AGTATTTGGC 

TAGTCAAGAT 

GATGAATCTT 

CATTATCTGA 

TATATTGCAA 

ATCACTCAAT 

5820 

ATCTAGACTT 

TCTGTTATTA 

TTATTGATCC 

AATCAAAAAA 

TAAATTAGAA 

GCCGTGGGTC 

5880 

ATTGTTATGA 

ATCTCTTTCA 

GAGGAATACA 

GACAATTGAC 

AAAATTCACA 

GACTTTCAAG 

5940 

ATTTTAAAAA 

ACTGTTTAAC 

AAGGTCCCTA 

TTGTTACAGA 

TGGAAGGGTC 

AAACTTAATA 

6000 

AAGGATATTT 

GTTCGACTTT 

GTGATTAGTT 

TGATGCGATT 

CAAAAAAGAA 

TCCTCTCTAG 

6060 

CTACCACCGC 

AATAGATCCT 

GTTAGATACA 

TAGATCCTCG 

TCGCAATATC 

GCATTTTCTA 

6120 

ACGTGATGGA 

TATATTAAAG 

TCGAATAAAG 

TGAACAATAA 

TTAATTCTTT 

ATTGTCATCA 

6180 

TGAACGGCGG 

ACATATTCAG 

TTGATAATCG 

GCCCCATGTT 

TTCAGGTAAA 

AGTACAGAAT 

6240 

TAATTAGACG 

AGTTAGACGT 

TATCAAATAG 

CTCAATATAA 

ATGCGTGACT 

ATAAAATATT 

6300 

CTAACGATAA 

TAGATACGGA 

ACGGGACTAT 

GGACGCATGA 

TAAGAATAAT 

TTTGAAG CAT 

6360 

TGGAAGCAAC 

TAAACTATGT 

GATGTCTTGG 

AATCAATTAC 

AGATTTCTCC 

GTGATAGGTA 

6420 

TCGATGAAGG 

ACAGTTCTTT 

CCAGACATTG 

TTGAATTGAT 

CTCGATCCCG 

CGAAATTAAT 

6480 

ACGACTCACT 

ATAGGGAGAC 

CACAACGGTT 

TCCCTCTAGC 

GGGATCAATT 

CCGCCCCTCT 

6540 

CCCTCCCCCC 

CCCCTAACGT 

TACTGGCCGA 

AGCCGCTTGG 

AATAAGGCCG 

GTGTGCGTTT 

6600 

GTCTATATGT 

TATTTTCCAC 

CATATTGCCG 

TCTTTTGGCA 

ATGTGAGGGC 

CCGGAAACCT 

6660 

GGCCCTGTCT 

TCTTGACGAG 

CATTCCTAGG 

GGTCTTTCCC 

CTCTCGCCAA 

AGGAATGCAA 

6720 

GGTCTGTTGA 

ATGTCGTGAA 

GGAAGCAGTT 

CCTCTGGAAG 

CTTCTTGAAG 

ACAAACAACG 

6780 

TCTGTAGCGA 

CCCTTTGCAG 

GCAGCGGAAC 

CCCCCACCTG 

GCGACAGGTG 

CCTCTGCGGC 

6840 

CAAAAGCCAC 

GTGTATAAGA 

TACACCTGCA 

AAGGCGGCAC 

AACCCCAGTG 

C CACGTTGTG 

6900 

AGTTGGATAG 

TTGTGGAAAG 

AGTCAAATGG 

CTCTCCTCAA 

GCGTATTCAA 

C AAGGGGC TG 

6960 

AAGGATGCCC 

AGAAGGTACC 

CCATTGTATG 

GGATCTGATC 

TGGGGCCTCG 

GTGCACATGC 

7020 

TTTACATGTG 

TTTAGTCGAG 

GTTAAAAAAC 

GTCTAGGCCC 

CCCGAACCAC 

GGGGACGTGG 

7080 

TTTTCCTTTG 

AAAAACACGA 

TAATACC 
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(2) INFORMATION FOR SEQ ID NO: 22 





(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12011 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 




(ii) MOLECULE TYPE: cDNA 





(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22 




ATGGACACTG 

AATCTAACAA 

TGGCACTGTA 

TCTGACATAC 

TCTATCCTGA 

GTGTCACCTT 

60 

AACTCTCCTA 

TCGTTAAAGG 

TAAAATAGCA 

CAATTACACA 

CTATTATGAG 

TCTACCTCAG 

120 

CCTTATGATA 

TGGATGACGA 

CTCAATACTA 

GTTATCACTA 

GACAGAAAAT 

AAAACTTAAT 

180 

AAATTGGATA 

AAAGACAACG 

ATCTATTAGA 

AGATTAAAAT 

TAATATTAAC 

TGAAAAAGTG 

240 

AATGACTTAG 

GAAAATACAC 

ATTTATCAGA 

TATCCAGAAA 

TGTCAAAAGA 

AATGTTCAAA 

300 
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TTATATATAC 

CTGGTATTAA 

CAGTAAAGTG 

ACTGAATTAT 

TACTTAAAGC 

AGATAGAACA 

360 

TATAGTCAAA 

TGACTGATGG 

ATTAAGAGAT 

CTATGGATTA 

ATGTGCTATC 

AAAATTAGCC 

420 

TCAAAAAATG 

ATGGAAGCAA 

TTATGATCTT 

AATGAAGAAA 

TTAATAATAT 

ATCGAAAGTT 

480 

C AC AC AAC C T 

ATAAATCAGA 

TAAATGGTAT 

AATC CATTCA 

AAACATGGTT 

TACTATCAAG 

540 

TATGATATGA 

GAAGATTACA 

AAAAGCTCGA 

AATGAGATCA 

CTTTTAATGT 

TGGGAAGGAT 

600 

TATAACTTGT 

TAGAAGACCA 

GAAGAATTTC 

TTATTGATAC 

ATCCAGAATT 

GGTTTTGATA 

660 

TTAGATAAAC 

AAAACTATAA 

TGGTTATCTA 

ATTACTCCTG 

AATTAGTATT 

GATGTATTGT 

720 

GACGTAGTCG 

AAGGCCGATG 

GAATATAAGT 

GCATGTGCTA 

AGTTAG AT C C 

AAAATTACAA 

780 

TCTATGTATC 

AGAAAGGTAA 

TAACCTGTGG 

GAAGTGATAG 

ATAAATTGTT 

TCCAATTATG 

840 

GGAGAAAAGA 

CATTTGATGT 

GATATCGTTA 

TTAGAAC C AC 

TTGCATTATC 

CTTAATTCAA 

900 

ACTCATGATC 

CTGTTAAACA 

ACTAAGAGGA 

GCTTTTTTAA 

ATCATGTGTT 

ATCCGAGATG 

960 

GAATTAATAT 

TTGAATCTAG 

AGAATCGATT 

AAGGAATTTC 

TGAGTGTAGA 

TTACATTGAT 

1020 

AAAATTTTAG 

ATATATTTAA 

TAAGTCTACA 

ATAGATGAAA 

TAGCAGAGAT 

TTTCTCTTTT 

1080 

TTTAGAACAT 

TTGGGCATCC 

TCCATTAGAA 

GCTAGTATTG 

CAGCAGAAAA 

GGTTAGAAAA 

1140 

TATATGTATA 

TTGGAAAACA 

ATTAAAATTT 

GACACTATTA 

ATAAATGTCA 

TGCTATCTTC 

1200 

TGTACAATAA 

TAATTAACGG 

ATATAGAGAG 

AGGCATGGTG 

GAC AGTGGC C 

TCCTGTGACA 

1260 

TTACCTGATC 

ATGCACACGA 

ATTCATCATA 

AATGCTTACG 

GTTCAAACTC 

TGCGATATCA 

1320 

TATGAAAATG 

CTGTTGATTA 

TTACCAGAGC 

TTTATAGGAA 

TAAAATTCAA 

TAAATTCATA 

1380 

GAGCCTCAGT 

TAGATGAGGA 

TTTGACAATT 

TATATGAAAG 

ATAAAGCATT 

ATCTCCAAAA 

1440 

AAATCAAATT 

GGGACACAGT 

TTATCCTGCA 

TCTAATTTAC 

TGTACCGTAC 

TAACGCATCC 

1500 

AACGAAT C AC 

GAAGATTAGT 

TGAAGTATTT 

ATAGCAGATA 

GTAAATTTGA 

TCCTCATCAG 

1560 

ATATTGGATT 

ATGTAGAATC 

TGGGGACTGG 

TTAGATGATC 

CAGAATTTAA 

TATTTCTTAT 

1620 

AGTCTTAAAG 

AAAAAGAGAT 

CAAACAGGAA 

GGTAGACTCT 

TTGCAAAAAT 

GACATACAAA 

1680 

ATGAGAGCTA 

CACAAGTTTT 

ATCAGAGACA 

CTACTTGCAA 

ATAACATAGG 

AAAATTCTTT 

1740 

CAAGAAAATG 

GGATGGTGAA 

GGGAGAGATT 

GAATTACTTA 

AGAGATTAAC 

AACCATATCA 

1800 

ATATCAGGAG 

TTCCACGGTA 

TAATGAAGTG 

TACAATAATT 

CTAAAAGC CA 

TACAGATGAC 

1860 

CTTAAAACCT 

ACAATAAAAT 

AAGTAATCTT 

AATTTGTCTT 

CTAATCAGAA 

ATCAAAGAAA 

1920 

TTTGAATTCA 

AGTCAACGGA 

TATCTACAAT 

GATGGATACG 

AGACTGTGAG 

CTGTTTCCTA 

1980 

ACAACAGATC 

TCAAAAAATA 

CTGTCTTAAT 

TGGAGATATG 

AATCAACAGC 

TCTATTTGGA 

2040 

GAAACTTGCA 

ACCAAATATT 

TGGATTAAAT 

AAATTGTTTA 

ATTGGTTACA 

CCCTCGTCTT 

2100 

GAAGGAAGTA 

CAATCTATGT 

AGGTGATCCT 

TACTGTCCTC 

CATCAGATAA 

AGAACATATA 

2160 

TCATTAGAGG 

ATCACCCTGA 

TTCTGGTTTT 

TACGTTCATA 

ACCCAAGAGG 

GGGTATAGAA 

2220 

GG ATTTTGT C 

AAAAATTATG 

GACACTCATA 

TCTATAAGTG 

CAATACATCT 

AGCAGCTGTT 

2280 

AGAATAGGCG 

TGAGGGTGAC 

TGCAATGGTT 

CAAGGAGACA 

ATCAAGCTAT 

AGCTGTAACC 

2340 

ACAAGAGTAC 

CCAACAATTA 

TGACTACAGA 

GTTAAGAAGG 

AGATAGTTTA 

TAAAGATGTA 

2400 
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GTGAGATTTT 

TTGATTCATT 

AAGAGAAGTG 

ATGGATGATC 

TAGGTCATGA 

ACTTAAATTA 

2460 

AATGAAACGA 

TTATAAGTAG 

CAAGATGTTC 

ATATATAGCA 

AAAGAATCTA 

TTATGATGGG 

2520 

AGAATTCTTC 

CTCAAGCTCT 

AAAAGCATTA 

TCTAGATGTG 

TCTTCTGGTC 

AGAGACAGTA 

2580 

ATAGACGAAA 

CAAGATCAGC 

ATCTTCAAAT 

TTGGCAACAT 

CATTTGCAAA 

AGCAATTGAG 

2640 

AATGGTTATT 

CACCTGTTCT 

AGGATATGCA 

TGCTCAATTT 

TTAAGAACAT 

TCAACAACTA 

2700 

TATATTGCCC 

TTGGGATGAA 

TAT CAATC C A 

ACTATAACAC 

AGAATATCAG 

AGATCAGTAT 

2760 

TTTAGGAATC 

CAAATTGGAT 

GCAATATGCC 

TCTTTAATAC 

CTGCTAGTGT 

TGGGGGATTC 

2820 

AATTACATGG 

CCATGTCAAG 

ATGTTTTGTA 

AGGAATATTG 

GTGATCCATC 

AGTTGCCGCA 

2880 

TTGGCTGATA 

TTAAAAGATT 

TATTAAGGCG 

AATCTATTAG 

ACCGAAGTGT 

TCTTTATAGG 

2940 

ATTATGAATC 

AAGAACCAGG 

TGAGTCATCT 

TTTTTGGACT 

GGGCTTCAGA 

TCCATATTCA 

3000 

TGCAATTTAC 

CACAATCTCA 

AAATATAACC 

ACCATGATAA 

AAAATATAAC 

AGCAAGGAAT 

3060 

GTATTACAAG 

ATTCACCAAA 

TCCATTATTA 

TCTGGATTAT 

TCACAAATAC 

AATGATAGAA 

3120 

GAAGATGAAG 

AATTAGCTGA 

GTTCCTGATG 

GACAGGAAGG 

TAATTCTCCC 

TAGAGTTGCA 

3180 

CATGATATTC 

TAGATAATTC 

TCTCACAGGA 

ATTAGAAATG 

CCATAGCTGG 

AATGTTAGAT 

3240 

ACGACAAAAT 

CACTAATTCG 

GGTTGGCATA 

AATAGAGGAG 

GACTGACATA 

TAGTTTGTTG 

3300 

AGGAAAATCA 

GTAATTACGA 

TCTAGTACAA 

TATGAAACAC 

TAAGTAGGAC 

TTTGCGACTA 

3360 

ATTGTAAGTG 

ATAAAATCAA 

GTATGAAGAT 

ATGTGTTCGG 

TAGACCTTGC 

CATAGCATTG 

3420 

CGACAAAAGA 

TGTGGATTCA 

TTTATCAGGA 

GGAAGGATGA 

TAAGTGGACT 

TGAAACGCCT 

3480 

GACCCATTAG 

AATTACTATC 

TGGGGTAGTA 

ATAACAGGAT 

CAGAACATTG 

TAAAATATGT 

3540 

TATTCTTCAG 

ATGGCACAAA 

CCCATATACT 

TGGATGTATT 

TACCCGGTAA 

TATCAAAATA 

3600 

GGATCAGCAG 

AAACAGGTAT 

ATCGTCATTA 

AGAGTTCCTT 

ATTTTGGATC 

AGTCACTGAT 

3660 

GAAAGATCTG 

AAGCACAATT 

AGGATATATC 

AAGAATCTTA 

GTAAACCTGC 

AAAAGCCGCA 

3720 

ATAAGAATAG 

CAATGATATA 

TACATGGGCA 

TTTGGTAATG 

ATGAGATATC 

TTGGATGGAA 

3780 

GCCTCACAGA 

TAGCACAAAC 

ACGTGCAAAT 

TTTACACTAG 

ATAGTCTCAA 

AATTTTAACA 

3840 

CCGGTAGCTA 

CAT C AAC AAA 

TTTATCACAC 

AGATTAAAGG 

ATACTGCAAC 

TCAGATGAAA 

3900 

TTCTCCAGTA 

CATCATTGAT 

CAGAGTCAGC 

AGATTCATAA 

CAATGTCCAA 

TGATAACATG 

3960 

TCTATCAAAG 

AAGCTAATGA 

AACCAAAGAT 

ACTAATCTTA 

TTTATCAACA 

AATAATGTTA 

4020 

ACAGGATTAA 

GTGTTTTCGA 

ATATTTATTT 

AGATTAAAAG 

AAACCACAGG 

ACACAACCCT 

4080 

ATAGTTATGC 

ATCTGCACAT 

AGAAGATGAG 

TGTTGTATTA 

AAGAAAGTTT 

TAATGATGAA 

4140 

CATATTAATC 

CAGAGTCTAC 

ATTAGAATTA 

ATTCGATATC 

CTGAAAGTAA 

TGAATTTATT 

4200 

TATGATAAAG 

ACCCACTCAA 

AGATGTGGAC 

TTATCAAAAC 

TTATGGTTAT 

T AAAG AC CAT 

4260 

TCTTACACAA 

TTGATATGAA 

TTATTGGGAT 

GAT AC TG AC A 

TCATACATGC 

AATTTCAATA 

4320 

TGTACTGCAA 

TTACAATAGC 

AGATACTATG 

TCACAATTAG 

ATCGAGATAA 

TTTAAAAGAG 

4380 

ATAATAGTTA 

TTGCAAATGA 

TGATGATATT 

AATAGCTTAA 

TCACTGAATT 

TTTGACTCTT 

4440 

GACATACTTG 

TATTTCTCAA 

GACATTTGGT 

GGATTATTAG 

TAAATCAATT 

TGCATACACT 

4500 
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CTTTATAGTC 

TAAAAATAGA 

AGGTAGGGAT 

CTCATTTGGG 

ATTATATAAT 

GAG AAC AC TG 

4560 

AGAGATACTT 

CCCATTCAAT 

ATTAAAAGTA 

TTATCTAATG 

CATTATCTCA 

TCCTAAAGTA 

4620 

TTCAAGAGGT 

TCTGGGATTG 

TGGAGTTTTA 

AACCCTATTT 

ATGGTCCTAA 

TACTGCTAGT 

4680 

CAAGACCAGA 

TAAAACTTGC 

CCTATCTATA 

TGTGAATATT 

CACTAGATCT 

ATTTATGAGA 

4740 

GAATGGTTGA 

ATGGTGTATC 

ACTTGAAATA 

TACATTTGTG 

ACAGCGATAT 

GGAAGTTGCA 

4800 

AATGATAGGA 

AACAAGCCTT 

TATTTCTAGA 

CACCTTTCAT 

TTGTTTGTTG 

TTTAGCAGAA 

4860 

ATTGCATCTT 

TCGGACCTAA 

CCTGTTAAAC 

TTAACATACT 

TGGAGAGACT 

TGATCTATTG 

4920 

AAACAATATC 

TTGAATTAAA 

TATTAAAGAA 

GACCCTACTC 

TTAAATATGT 

ACAAATATCT 

4980 

GGATTATTAA 

TTAAATCGTT 

CCCATCAACT 

GTAACATACG 

TAAGAAAGAC 

TGCAATCAAA 

5040 

TATCTAAGGA 

TTCGCGGTAT 

TAGTCCACCT 

GAGGTAATTG 

ATGATTGGGA 

TCCGGTAGAA 

5100 

GATGAAAATA 

TGCTGGATAA 

CATTGTCAAA 

ACTATAAATG 

ATAACTGTAA 

TAAAGATAAT 

5160 

AAAGGGAATA 

AAATTAACAA 

TTTCTGGGGA 

CTAGCACTTA 

AGAACTATCA 

AGTCCTTAAA 

5220 

ATCAGATCTA 

TAACAAGTGA 

TTCTGATGAT 

AATGATAGAC 

TAGATGCTAA 

TACAAGTGGT 

5280 

TTGACACTTC 

CTCAAGGAGG 

GAATTATCTA 

TCGCATCAAT 

TGAGATTATT 

CGGAATCAAC 

5340 

AGCACTAGTT 

GTCTGAAAGC 

TCTTGAGTTA 

TCACAAATTT 

TAATGAAGGA 

AGTCAATAAA 

5400 

GACAAGGACA 

GGCTCTTCCT 

GGGAGAAGGA 

GCAGGAGCTA 

TGCTAGCAT G 

TTATGATGCC 

5460 

ACATTAGGAC 

CTGCAGTTAA 

TTATTATAAT 

TCAGGTTTGA 

ATATAACAGA 

TGTAATTGGT 

5520 

CAACGAGAAT 

TGAAAATATT 

TCCTTCAGAG 

GTATCATTAG 

TAGGTAAAAA 

ATTAGGAAAT 

5580 

GTGACACAGA 

TTCTTAACAG 

GGTAAAAGTA 

CTGTTCAATG 

GGAATCCTAA 

TTCAACATGG 

5640 

ATAGGAAATA 

TGGAATGTGA 

GAGCTTAATA 

TGGAGTGAAT 

TAAATGATAA 

GTCCATTGGA 

5700 

TTAGTACATT 

GTGATATGGA 

AGGAGCTATC 

GGTAAATCAG 

AAGAAACTGT 

TCTACATGAA 

5760 

CATTATAGTG 

TTATAAGAAT 

TACATACTTG 

ATTGGGGATG 

ATGATGTTGT 

TTTAGTTTCC 

5820 

AAAATTATAC 

CTACAATCAC 

TCCGAATTGG 

TCTAGAATAC 

TTTATCTATA 

TAAATTATAT 

5880 

TGGAAAGATG 

TAAGTATAAT 

ATCACTCAAA 

ACTTCTAATC 

CTGCATCAAC 

AGAATTATAT 

5940 

CTAATTTCGA 

AAGATGCATA 

TTGTACTATA 

ATGGAACCTA 

GTGAAATTGT 

TTTATCAAAA 

6000 

CTTAAAAGAT 

TGTCACTCTT 

GGAAGAAAAT 

AATCTATTAA 

AATGGATCAT 

TTTATCAAAG 

6060 

AAGAGGAATA 

ATGAATGGTT 

ACATCATGAA 

ATCAAAGAAG 

GAGAAAGAGA 

TTATGGAATC 

6120 

ATGAGACCAT 

ATCATATGGC 

ACTACAAATC 

TTTGGATTTC 

AAATCAATTT 

AAATCATCTG 

6180 

GCGAAAGAAT 

TTTTATCAAC 

CCCAGATCTG 

ACTAATATCA 

ACAATATAAT 

CCAAAGTTTT 

6240 

CAGCGAACAA 

TAAAGGATGT 

TTTATTTGAA 

TGGATTAATA 

TAACTCATGA 

TGATAAGAGA 

6300 

CATAAATTAG 

GCGGAAGATA 

TAACATATTC 

CCACTGAAAA 

ATAAGGGAAA 

GTTAAGACTG 

6360 

CTATCGAGAA 

GACTAGTATT 

AAGTTGGATT 

TCATTATCAT 

TATCGACTCG 

ATTACTTACA 

6420 

GGTCGCTTTC 

CTGATGAAAA 

ATTTGAACAT 

AGAGCACAGA 

CTGGATATGT 

ATCATTAGCT 

6480 

GATACTGATT 

TAGAATCATT 

AAAGTTATTG 

TCGAAAAACA 

T C ATTAAGAA 

TTACAGAGAG 

6540 

TGTATAGGAT 

CAATATCATA 

TTGGTTTCTA 

ACCAAAGAAG 

TTAAAATACT 

TATGAAATTG 

6600 
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ATTGGTGGTG CTAAATTATT AGGAATTCCC AGACAATATA AAGAACCCGA AGACCAGTTA 666 0 
TTAGAAAACT ACAATCAACA TGATGAATTT GATATCGATT AAAACATAAA TACAATGTCG 6 72 0 
ACGATCCGGC TGCTAACAAA GCCCGAAAGG AAGCTGAGTT GGCTGCTGCC ACCGCTGAGC 6 78 0 
AATAACTAGC ATAACCCCTT GGGGCCTCTA AACGGGTCTT GAGGGGTTTT TTGCTGAAAG 6 84 0 
GAGGAACTAT ATCCGGATCG AGATCAATTC TGTGAGCGTA TGGCAAACGA AGGAAAAATA 6900 
GTTATAGTAG CCGCACTCGA TGGGACATTT CAACGTAAAC CGTTTAATAA TATTTTGAAT 6 96 0 
CTTATTCCAT TATCTGAAAT GGTGGTAAAA CTAACTGCTG TGTGTATGAA ATGCTTTAAG 7020 
GAGGCTTCCT TTTCTAAACG ATTGGGTGAG GAAACCGAGA TAGAAATAAT AGGAGGTAAT 7 08 0 
GATATGTATC AATCGGTGTG TAGAAAGTGT TACATCGACT CATAATATTA TATTTTTTAT 7140 
CTAAAAAACT AAAAATAAAC ATTGATTAAA TTTTAATATA ATACTTAAAA ATGGATGTTG 72 00 
TGTCGTTAGA TAAACCGTTT ATGTATTTTG AGGAAATTGA TAATGAGTTA GATTACGAAC 726 0 
CAGAAAGTGC AAATGAGGTC GCAAAAAAAC TGCCGTATCA AGGACAGTTA AAACTATTAC 732 0 
TAGGAGAATT ATTTTTTCTT AGTAAGTTAC AGCGACACGG TATATTAGAT GGTGCCACCG 73 8 0 
TAGTGTATAT AGGATCTGCT CCCGGTACAC ATATACGTTA TTTGAGAGAT CATTTCTATA 744 0 
ATTTAGGAGT GATCATCAAA TGGATGCTAA TTGACGGCCG CCATCATGAT CCTATTTTAA 7 5 00 
ATGGATTGCG TGATGTGACT CTAGTGACTC GGTTCGTTGA TGAGGAATAT CTACGATCCA 756 0 
TCAAAAAACA ACTGCATCCT T CTAAG ATT A TTTTAATTTC TGATGTGAGA TCCAAACGAG 762 0 
GAGGAAATGA ACCTAGTACG GCGGATTTAC TAAGTAATTA CGCTCTACAA AATGTCATGA 76 8 0 
TTAGTATTTT AAACCCCGTG GCGTCTAGTC TTAAATGGAG ATGCCCGTTT CCAGATCAAT 774 0 
GGATCAAGGA CTTTTATATC C C AC ACGGT A ATAAAATGTT ACAACCTTTT GCTCCTTCAT 78 00 
ATTCAGGGCC GTCGTTTTAC AACGTCGTGA CTGGGAAAAC CCTGGCGTTA CCCAACTTAA 7860 
TCGCCTTGCA GCACATCCCC CTTTCGCCAG CTGGCGTAAT AGCGAAGAGG CCCGCACCGA 792 0 
TCGCCCTTCC CAACAGTTGC GCAGCCTGAA TGGCGAATGG CGCGACGCGC CCTGTAGCGG 7980 
CGCATTAAGC GCGGCGGGTG TGGTGGTTAC GCGCAGCGTG ACCGCTACAC TTGCCAGCGC 8040 
CCTAGCGCCC GCTCCTTTCG CTTTCTTCCC TTCCTTTCTC GCCACGTTCG CCGGCTTTCC 8100 
CCGTCAAGCT CTAAATCGGG GGCTCCCTTT AGGGTTC CGA TTTAGTGCTT TACGGCACCT 8160 
CGACCCCAAA AAACTTGATT AGGGTGATGG TTCACGTAGT GGGCCATCGC CCTGATAGAC 822 0 
GGTTTTTCGC CCTTTGACGT TGGAGTCCAC GTTCTTTAAT AGTGGACTCT TGTTCCAAAC 8280 
TGGAACAACA CTCAACCCTA TCTCGGTCTA TTCTTTTGAT TTATAAGGGA TTTTGCCGAT 834 0 
TTCGGCCTAT TGGTTAAAAA ATGAGCTGAT TTAACAAAAA TTTAACGCGA ATTTTAACAA 8400 
AATATTAACG TTTACAATTT CCCAGGTGGC ACTTTTCGGG GAAATGTGCG CGGAACCCCT 8460 
ATTTGTTTAT TTTTCTAAAT ACATTCAAAT ATGTATCCGC TCATGAGACA ATAACCCTGA 8 52 0 
TAAATGCTTC AATAATATTG AAAAAGGAAG AGTATGAGTA TTCAACATTT CCGTGTCGCC 8 580 
CTTATTCCCT TTTTTGCGGC ATTTTGCCTT CCTGTTTTTG CTCACCCAGA AACGCTGGTG 8640 
AAAGTAAAAG ATGCTGAAGA TCAGTTGGGT GCACGAGTGG GTTACATCGA ACTGGATCTC 8700 
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AACAGCGGTA 

AGATCCTTGA 

GAGTTTTCGC 

CCCGAAGAAC 

GTTTTCCAAT 

GATGAGCACT 

8760 

TTTAAAGTTC 

TGCTATGTGG 

CGCGGTATTA 

TCCCGTATTG 

ACGCCGGGCA 

AGAGCAACTC 

8820 

GGTCGCCGCA 

TACACTATTC 

TCAGAATGAC 

TTGGTTGAGT 

ACTCACCAGT 

CACAGAAAAG 

8880 

CATCTTACGG 

ATGGCATGAC 

AGTAAGAGAA 

TTATGCAGTG 

CTGCCATAAC 

CATGAGTGAT 

8940 

AACACTGCGG 

CCAACTTACT 

TCTGACAACG 

ATCGGAGGAC 

CGAAGGAGCT 

AACCGCTTTT 

9000 

TTGCACAACA 

TGGGGGATCA 

TGTAACTCGC 

CTTGATCGTT 

GGGAACCGGA 

GCTGAATGAA 

9060 

GCCATACCAA 

ACGACGAGCG 

TGACACCACG 

atgcctgtag 

CAATGGCAAC 

AACGTTGCGC 

9120 

AAACTATTAA 

CTGGCGAACT 

ACTTACTCTA 

GCTTCCCGGC 

AACAATTAAT 

AGACTGGATG 

9180 

GAGGCGGATA 

AAGTTGCAGG 

ACCACTTCTG 

CGCTCGGCCC 

TTCCGGCTGG 

CTGGTTTATT 

9240 

GCTGATAAAT 

CTGGAGCCGG 

TGAGCGTGGG 

TCTCGCGGTA 

TCATTGCAGC 

ACTGGGGCCA 

9300 

GATGGTAAGC 

CCTCCCGTAT 

CGTAGTTATC 

TACACGACGG 

GGAGTCAGGC 

AACTATGGAT 

9360 

GAACGAAATA 

GACAGATCGC 

TGAGATAGGT 

GCCTCACTGA 

TTAAGCATTG 

GTAACTGTCA 

9420 

GACCAAGTTT 

ACTCATATAT 

ACTTTAGATT 

GATTTAAAAC 

TTCATTTTTA 

ATTTAAAAGG 

9480 

ATCTAGGTGA 

AGATCCTTTT 

TGATAATCTC 

ATGACCAAAA 

tcccttaacg 

TGAGTTTTCG 

9540 

TTCCACTGAG 

CGTCAGACCC 

CGTAGAAAAG 

ATCAAAGGAT 

CTTCTTGAGA 

TCCTTTTTTT 

9600 

CTGCGCGTAA 

TCTGCTGCTT 

GCAAACAAAA 

AAACCACCGC 

TACCAGCGGT 

GGTTTGTTTG 

9660 

CCGGATCAAG 

AGCTACCAAC 

TCTTTTTCCG 

AAGGTAACTG 

GCTTCAGCAG 

AGCGCAGATA 

9720 

CCAAATACTG 

TCCTTCTAGT 

GTAGCCGTAG 

TTAGGCCACC 

ACTTCAAGAA 

CTCTGTAGCA 

9780 

CCGCCTACAT 

ACCTCGCTCT 

GCTAATCCTG 

TTACCAGTGG 

CTGCTGCCAG 

TGGCGATAAG 

9840 

TCGTGTCTTA 

CCGGGTTGGA 

CTCAAGACGA 

TAGTTACCGG 

ATAAGGCGCA 

GCGGTCGGGC 

9900 

TGAACGGGGG 

GTTCGTGCAC 

ACAGCCCAGC 

TTGGAGCGAA 

CGACCTACAC 

CGAACTGAGA 

9960 

TACCTACAGC 

GTGAGCTATG 

AGAAAGCGCC 

ACGCTTCCCG 

AAGGGAGAAA 

GGCGGACAGG 

10020 

TATCCGGTAA 

GCGGCAGGGT 

CGGAACAGGA 

GAGCGCACGA 

GGGAGCTTCC 

AGGGGGAAAC 

10080 

GCCTGGTATC 

TTTATAGTCC 

TGTCGGGTTT 

CGCCACCTCT 

GACTTGAGCG 

TCGATTTTTG 

10140 

TGATGCTCGT 

CAGGGGGGCG 

GAGCCTATGG 

AAAAACGCCA 

GCAACGCGGC 

ctttttacgg 

10200 

TTCCTGGCCT 

TTTGCTGGCC 

TTTTGCTCAC 

ATGTTCTTTC 

CTGCGTTATC 

CCCTGATTCT 

10260 

GTGGAT AAC C 

GTATTACCGC 

CTTTGAGTGA 

GCTGATACCG 

CTCGCCGCAG 

CCGAACGACC 

10320 

GAGCGCAGCG 

AGTCAGTGAG 

CGAGGAAGCG 

GAAGAGCGCC 

CAATACGCAA 

AC CGCCTCTC 

10380 

CCCGCGCGTT 

GGCCGATTCA 

TTAATGCAGC 

TGGCACGACA 

GGTTTCCCGA 

CTGGAAAGCG 

10440 

GGCAGTGAGC 

GCAACGCAAT 

TAATGTGAGT 

TAGCTCACTC 

ATTAGGCACC 

CCAGGCTTTA 

10500 

CACTTTATGC 

TTCCGGCTCG 

TATGTTGTGT 

GGAATTGTGA 

GCGGATAACA 

ATTTCACACA 

10560 

GGAAACAGCT 

ATGACCATGA 

TTACGCCAAG 

CTTTTGCGAT 

CAATAAATGG 

ATCACAACCA 

10620 

GTATCTCTTA 

ACGATGTTCT 

TCGCAGATGA 

TGATTCATTT 

TTTAAGTATT 

TGGCTAGTCA 

10680 

AGATGATGAA 

TCTTCATTAT 

CTGATATATT 

GCAAATCACT 

CAATATCTAG 

ACTTTCTGTT 

10740 

ATT ATT AT TG 

ATCCAATCAA 

AAAATAAATT 

AGAAGC CGTG 

GGTCATTGTT 

ATGAATCTCT 

10800 



TTCAGAGGAA 

TACAGACAAT 

TGACAAAATT 

CACAGACTTT 

CAAGATTTTA 

AAAAAC TGTT 

10860 

TAACAAGGTC 

C CTATTGTT A 

CAGATGGAAG 

GGTCAAACTT 

AATAAAGGAT 

ATTTGTTCGA 

10920 

CTTTGTGATT 

AGTTTGATGC 

GATTCAAAAA 

AGAATCCTCT 

CTAGCTACCA 

CCGCAATAGA 

10980 

TCCTGTTAGA 

TACATAGATC 

CTCGTCGCAA 

TATCGCATTT 

TCTAACGTGA 

TGGATATATT 

11040 

AAAGTCGAAT 

AAAGTGAACA 

ATAATTAATT 

CTTTATTGTC 

ATCATGAACG 

GCGGACATAT 

11100 

T C AGTTG AT A 

ATCGGCCCCA 

TGTTTTCAGG 

TAAAAGTACA 

GAATTAATTA 

GACGAGTTAG 

11160 

ACGTTATCAA 

ATAGCTCAAT 

ATAAATGCGT 

GACTATAAAA 

TATTCTAACG 

ATAATAGATA 

11220 

CGGAACGGGA 

CTATGGACGC 

ATGATAAGAA 

TAATTTTGAA 

GCATTGGAAG 

CAACTAAACT 

11280 

ATGTGATGTC 

TTGGAATCAA 

TTACAGATTT 

CTCCGTGATA 

GGTATCGATG 

AAGGACAGTT 

11340 

CTTTCCAGAC 

AT TGTTGAAT 

TGATCTCGAT 

CCCGCGAAAT 

TAATACGACT 

CACTATAGGG 

11400 

AGACCACAAC 

GGTTTCCCTC 

TAGCGGGATC 

AATTCCGCCC 

CTCTCCCTCC 

CCCCCCCCTA 

11460 

ACGTTACTGG 

CCGAAGCCGC 

TTGGAATAAG 

GCCGGTGTGC 

GTTTGTCTAT 

ATGTTATTTT 

11520 

C C AC CAT ATT 

GCCGTCTTTT 

GGCAATGTGA 

GGGCCCGGAA 

ACCTGGCCCT 

GTCTTCTTGA 

11580 

CGAGCATTCC 

TAGGGGTCTT 

TCCCCTCTCG 

CCAAAGGAAT 

GCAAGGTCTG 

TTGAATGTCG 

11640 

TGAAGGAAGC 

AGTTCCTCTG 

GAAGCTTCTT 

GAAGACAAAC 

AACGTCTGTA 

GCGACCCTTT 

11700 

GCAGGCAGCG 

GAACCCCCCA 

CCTGGCGACA 

GGTGCCTCTG 

CGGCCAAAAG 

CCACGTGTAT 

11760 

AAGATACACC 

TGCAAAGGCG 

GCACAACCCC 

AGTGCCACGT 

TGTGAGTTGG 

ATAGTTGTGG 

11820 

AAAGAGTCAA 

ATGGCTCTCC 

TCAAGCGTAT 

TCAACAAGGG 

GCTGAAGGAT 

GCCCAGAAGG 

11880 

TACCCCATTG 

TATGGGATCT 

GATCTGGGGC 

CTCGGTGCAC 

ATGCTTTACA 

TGTGTTTAGT 

11940 

C G AGGTTAAA 

AAACGTCTAG 

GCCCCCCGAA 

CCACGGGGAC 

GTGGTTTTCC 

TTTGAAAAAC 

12000 

ACGATAATAC 

C 
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(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:23: 

GATCGATGCT AGCCC 15 

(2) INFORMATION FOR SEQ ID NO : 24 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 


(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24 
GATCGGGCTA GCATC 

(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25 
TTACATGGCC AT 

(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26 
TCACATGGCG AT 

(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO:27 
TTTGGACTGG GC 

(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28 
TTTTGATTGG GC 


(2) INFORMATION FOR SEQ ID NO: 29: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29 
TGGTCCTAAT ACTG 

(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30 
TGGGCCTAAT ATCG 

(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31 
GCATTATCTA GATGTGTCTT CTGGTCAGAG 
(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32 
CCTGAATTAT AATAATTAAC TGCAGGTCCT 
(2) INFORMATION FOR SEQ ID NO: 33: 

SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 


(i) 
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(ii) MOLECULE TYPE: cDNA 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 

GGGAAAGAAT CCAGAGACAA GAACGG 26 

(2) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 

GGTGAAGTTG TGGATCCATT TGATTG 26 

(2) INFORMATION FOR SEQ ID NO: 35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 

CAACCTGTAA GGTACCAGCA TCCG 24 

(2) INFORMATION FOR SEQ ID NO: 36: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 

GATATGGTGT TAGGCCTTGA TCTGTTC 27 

(2) INFORMATION FOR SEQ ID NO: 37: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO:37: 



CGCCATGGAA AAATCAGAGA TCCTCTTCT 


(2) INFORMATION FOR SEQ ID NO: 38: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38 
CTGGATCCTA ATTGGAGTTG TTACCCATGT A 
(2) INFORMATION FOR SEQ ID NO: 39: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39 
AACCATGGCT GAAAAAGGGA AAA 
(2) INFORMATION FOR SEQ ID NO: 40: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40 
GGTGAAGCTT AAGATGTGAT TTTACATATT TTA 
(2) INFORMATION FOR SEQ ID NO: 41: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41 
AAATAGGATC CCTACAGATC ATTAGATATT AAAAT 
(2) INFORMATION FOR SEQ ID NO: 42: 

(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42: 

CGCCATGGTG TTCAGTGCTT GTTG 24 

(2) INFORMATION FOR SEQ ID NO: 43: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43: 

CCACAAGCTT AATTAACCAT AATATGCATC A 31 

(2) INFORMATION FOR SEQ ID NO: 44: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44: 

TTCCATGGAT TTGGATTTGT CTATTGGGT 29 

(2) INFORMATION FOR SEQ ID NO: 45: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15462 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO:45: 

ACCAAACAAG AGAAGAAACT TGCTTGGTAA TATAAATTTA ACTTAAAATT AACTTAGGAT 60 
TTAAGACATT GACTAGAAGG TCAAGAAAAG GGAACTCTAT AATTTCAAAA ATGTTGAGCC 120 
TATTTGATAC ATTTAATGCA CGTAGGCAAG AAAACATAAC AAAATCAGCC GGTGGAGCTA 180 
TCATTCCTGG ACAGAAAAAT ACTGTCTCTA TATTCGCCCT TGGACCGACA ATAACTGATG 240 
ATAATGAGAA AATGACATTA GCTCTTCTAT TTCTATCTCA TTCACTAGAT AATGAGAAAC 300 
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AACATGCACA 

AAGGGCAGGG 

TTCTTGGTGT 

CTTTATTGTC 

aatggcttat 

GCCAATCCAG 

360 

AGCTCTACCT 

AACAACAAAT 

GGAAGTAATG 

CAGATGCCAA 

GTATGTCATA 

TACATGATTG 

420 

AGAAAGATCT 

AAAACGGCAA 

AAGTATGGAG 

GATTTGTGGT 

TAAGACGAGA 

GAGATGATAT 

480 

ATGAAAAGAC 

AACTGATTGG 

ATATTTGGAA 

GTGACCTGGA 

TTATGATCAG 

GAAAC T ATGT 

540 

TGCAGAACGG 

CAGGAACAAT 

TCAACAATTG 

AAGACCTTGT 

CCACACATTT 

GGGTATCCAT 

600 

CATGTTTAGG 

AGCTCTTATA 

ATACAGATCT 

GGATAGTTCT 

GGTCAAAGCT 

ATCACTAGTA 

660 

TCTCAGGGTT 

AAGAAAAGGC 

TTTTTCACCC 

GATTGGAAGC 

TTTCAGACAA 

GATGGAACAG 

720 

TGCAGGCAGG 

GCTGGTATTG 

AGCGGTGACA 

CAGTGGATCA 

GATTGGGTCA 

ATCATGCGGT 

780 

CTCAACAGAG 

CTTGGTAACT 

CTTATGGTTG 

AAACATTAAT 

AACAATGAAT 

ACCAGCAGAA 

840 

ATGACCTCAC 

AAC C AT AG AA 

AAGAATATAC 

AAATTGTTGG 

CAACTACATA 

AGAGATGCAG 

900 

GTCTCGCTTC 

ATTCTTCAAT 

ACAATCAGAT 

ATGGAATTGA 

GACCAGAATG 

GCAGCTTTGA 

960 

CTCTATCCAC 

TCTCAGACCA 

GATATCAATA 

GATTAAAAGC 

TTTGATGGAA 

CTGTATTTAT 

1020 

CAAAGGGACC 

ACGCGCTCCT 

TTCATCTGTA 

TCCTCAGAGA 

TCCTATACAT 

GGTGAGTTCG 

1080 

CACCAGGCAA 

CTATCCTGCC 

ATATGGAGCT 

ATGCAATGGG 

GGTGGCAGTT 

GTACAAAATA 

1140 

GAGCCATGCA 

ACAGTATGTG 

ACGGGAAGAT 

CATATCTAGA 

CATTGATATG 

TTCCAGCTAG 

1200 

GACAAGCAGT 

AGCACGTGAT 

GCCGAAGCTC 

AAATGAGCTC 

AACACTGGAA 

GATGAAC TTG 

1260 

GAGTGACACA 

CGAAGCTAAA 

GAAAGCTTGA 

AGAGACATAT 

AAGGAACATA 

AACAGTTCAG 

1320 

AGACATCTTT 

CCACAAACCG 

ACAGGTGGAT 

CAGCCATAGA 

GATGGCAATA 

GATGAAGAGC 

1380 

CAGAACAATT 

CGAACATAGA 

GCAGATCAAG 

AACAAAATGG 

AGAACCTCAA 

TCATCCATAA 

1440 

TTCAATATGC 

CTGGGCAGAA 

GGAAATAGAA 

GCGATGATCA 

GACTGAGCAA 

GCTACAGAAT 

1500 

CTGACAATAT 

CAAGACCGAA 

CAACAAAACA 

TCAGAGACAG 

ACTAAACAAG 

AGACTCAACG 

1560 

ACAAGAAGAA 

ACAAAGCAGT 

CAACCACCCA 

CTAATC CCAC 

AAACAGAACA 

AACCAGGACG 

1620 

AAATAGATGA 

TCTGTTTAAC 

GCATTTGGAA 

GCAACTAATC 

GAATCAACAT 

TTTAATCTAA 

1680 

ATCAATAATA 

AATAAGAAAA 

ACTTAGGATT 

AAAGAATCCT 

ATCATACCGG 

AATATAGGGT 

1740 

GGTAAATTTA 

GAGTCTGCTT 

GAAACTCAAT 

CAATAGAGAG 

TTGATGGAAA 

GCGATGCTAA 

1800 

AAACTATCAA 

ATCATGGATT 

CTTGGGAAGA 

GGAATCAAGA 

GATAAATCAA 

CTAATATCTC 

1860 

CTCGGCCCTC 

AACATCATTG 

AATTCATACT 

CAGCACCGAC 

CCCCAAGAAG 

ACTTATCGGA 

1920 

AAACGACACA 

ATCAACACAA 

GAACCCAGCA 

ACTCAGTGCC 

ACCATCTGTC 

AACCAGAAAT 

1980 

CAAACCAACA 

GAAACAAGTG 

AGAAAGATAG 

TGGATCAACT 

GACAAAAATA 

GACAGTCTGG 

2040 

GTCATCACAC 

GAATGTACAA 

CAGAAGCAAA 

AGATAGAAAC 

ATTGATCAGG 

AAACTGTACA 

2100 

GAGAGGACCT 

GGGAGAAGAA 

GCAGCTCAGA 

TAGTAGAGCT 

GAGACTGTGG 

TCTCTGGAGG 

2160 

AATCCCCAGA 

AGCATCACAG 

ATTCTAAAAA 

TGGAACCCAA 

AACACGGAGG 

ATATTGATCT 

2220 

CAATGAAATT 

AGAAAGATGG 

ATAAGGACTC 

TATTGAGGGG 

AAAATGCGAC 

AATCTGCAAA 

2280 

TGTTCCAAGC 

GAGATATCAG 

GAAGTGATGA 

CATATTTACA 

ACAGAACAAA 

GTAGAAACAG 

2340 

TGATCATGGA 

AGAAGCCTGG 

AATCTATCAG 

TACACCTGAT 

ACAAGATCAA 

TAAGTGTTGT 

2400 
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TACTGCTGCA 

ACACCAGATG 

ATGAAGAAGA 

AATACTAATG 

AAAAATAGTA 

GGACAAAGAA 

2460 

aagttcttca 

ACACATCAAG 

AAGATGACAA 

AAGAATTAAA 

AAAGGGGGAA 

AAGGGAAAGA 

2520 

CTGGTTTAAG 

AAATCAAAAG 

ATACCGACAA 

CCAGATACCA 

ACATCAGACT 

ACAGATCCAC 

2580 

ATCAAAAGGG 

CAGAAGAAAA 

TCTCAAAGAC 

AACAACC AC C 

AACACCGACA 

CAAAGGGGCA 

2640 

AACAGAAATA 

CAGACAGAAT 

CATCAGAAAC 

ACAATCCTCA 

TCATGGAATC 

TCATCATCGA 

2700 

CAACAACACC 

GACCGGAACG 

AACAGACAAG 

CACAACTCCT 

CCAACAACAA 

CTTCCAGATC 

2760 

AACTTATACA 

AAAGAATCGA 

TCCGAACAAA 

CTCTGAATCC 

AAACCCAAGA 

CACAAAAGAC 

2820 

AAATGGAAAG 

GAAAGGAAGG 

ATACAGAAGA 

GAGCAATCGA 

TTTACAGAGA 

GGGCAATTAC 

2880 

TCTATTGCAG 

AATCTTGGTG 

TAATTCAATC 

CACATCAAAA 

CTAGATTTAT 

ATCAAGACAA 

2940 

ACGAGTTGTA 

TGTGTAGCAA 

ATGTACTAAA 

CAATGTAGAT 

ACTGCATCAA 

AGATAGATTT 

3000 

CCTGGCAGGA 

TTAGTCATAG 

GGGTTTCAAT 

GGACAACGAC 

ACAAAATTAA 

CACAGATACA 

3060 

AAATGAAATG 

CTAAACCTCA 

AAGCAGATCT 

AAAGAAAATG 

GACGAATCAC 

ATAGAAGATT 

3120 

GATAGAAAAT 

CAAAGAGAAC 

AACTGTCATT 

GATCACGTCA 

CTAATTTCAA 

ATCTCAAAAT 

3180 

TATGACTGAG 

AGAGGAGGAA 

AGAAAGACCA 

AAATGAATCC 

AATGAGAGAG 

TATCCATGAT 

3240 

CAAAACAAAA 

TTGAAAGAAG 

AAAAGATCAA 

GAAGACCAGG 

TTTGACCCAC 

TTATGGAGGC 

3300 

AC AAGG C ATT 

GACAAGAATA 

TACCCGATCT 

ATATCGACAT 

GCAGGAGATA 

CACTAGAGAA 

3360 

CGATGTACAA 

GTTAAATCAG 

AGATATTAAG 

TTCATACAAT 

GAGTCAAATG 

CAACAAGACT 

3420 

AATACCCAAA 

AAAGTGAGCA 

GTACAATGAG 

ATCACTAGTT 

GCAGTCATCA 

ACAACAGCAA 

3480 

TCTCTCACAA 

AGCACAAAAC 

AATCATACAT 

AAACGAACTC 

AAACGTTGCA 

AAAATGATGA 

3540 

AGAAGTATCT 

GAATTAATGG 

ACATGTTCAA 

TGAAGATGTC 

AACAATTGCC 

AATGATCCAA 

3600 

CAAAGAAACG 

ACACCGAACA 

AACAGACAAG 

AAACAACAGT 

AGATCAAAAC 

CTGTCAACAC 

3660 

ACACAAAATC 

AAGCAGAATG 

AAACAACAGA 

TATCAATCAA 

TATACAAATA 

AGAAAAACTT 

3720 

AGGATTAAAG 

AATAAATTAA 

TCCTTGTCCA 

AAATGAGTAT 

AACTAACTCT 

GCAATATACA 

3780 

C ATTCCCAGA 

ATCATCATTC 

TCTGAAAATG 

GTCATATAGA 

ACCATTACCA 

CTCAAAGTCA 

3840 

ATGAACAGAG 

GAAAGCAGTA 

CCCCACATTA 

GAGTTGCCAA 

GATCGGAAAT 

CCACCAAAAC 

3900 

ACGGATCCCG 

GTATTTAGAT 

GTCTTCTTAC 

TCGGCTTCTT 

CGAGATGGAA 

CGAATCAAAG 

3960 

ACAAATACGG 

GAGTGTGAAT 

GATCTCGACA 

GTGACCCGAG 

TTACAAAGTT 

TGTGGCTCTG 

4020 

GATCATTACC 

AATCGGATTG 

GCTAAGTACA 

CTGGGAATGA 

CCAGGAATTG 

TTACAAGCCG 

4080 

CAACCAAACT 

GGATATAGAA 

GTGAGAAGAA 

CAGTCAAAGC 

GAAAGAGATG 

GTTGTTTACA 

4140 

CGGTACAAAA 

TATAAAACCA 

GAACTGTACC 

CATGGTCCAA 

TAGACTAAGA 

AAAGGAATGC 

4200 

TGTTCGATGC 

CAACAAAGTT 

GCTCTTGCTC 

CTCAATGTCT 

TCCACTAGAT 

AGGAGCATAA 

4260 

AATTTAGAGT 

AATCTTCGTG 

AATTGTACGG 

CAATTGGATC 

AATAACCTTG 

TTCAAAATTC 

4320 

CTAAGTCAAT 

GGCATCACTA 

TCGTTAACCA 

ACACAATATC 

AATCAATCTG 

CAGGTACACA 

4380 

TAAAAACAGG 

GGTTCAGACT 

GATTCTAAAG 

GGATAGTTCA 

AATTTTGGAT 

GAGAAAGGCG 

4440 

AAAAATCACT 

GAATTTCATG 

GTCCATCTCG 

GATTGATCAA 

AAGAAAAGTA 

GGCAGAATGT 

4500 
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AC T CTGTTG A ATACTGTAAA CAGAAAATCG AGAAAATGAG ATTGATATTT TCTTTAGGAC 4 560 

TAGTTGGAGG AATCAGTCTT CATGTCAATG CAACTGGGTC CATATCAAAA ACACTAGCAA 462 0 

GTCAGCTGGT ATTCAAAAGA GAGATTTGTT ATC CTTTAAT GGATCTAAAT CCGCATCTCA 46 8 0 

ATCTAGTTAT CTGGGCTTCA TCAGTAGAGA TTACAAGAGT GGATGCAATT TTCCAACCTT 4740 

CTTTACCTGG CGAGTTCAGA TACTATC CTA ATATTATTGC AAAAGGAGTT GGGAAAATCA 4 80 0 

AACAATGGAA CTAGTAATCT CTATTTTAGT CCGGACGTAT CTATTAAGCC GAAGCAAATA 4860 

AAGGATAATC AAAAACTTAG GACAAAAGAG GTCAATACCA ACAACTATTA GCAGTCACAC 4 92 0 

TCGCAAGAAT AAGAGAGAAG GGACCAAAAA AGTCAAATAG GAGAAATCAA AACAAAAGGT 4 98 0 

ACAGAACACC AGAACAACAA AATCAAAACA TCCAACTCAC TCAAAACAAA AATTCCAAAA 504 0 

GAGACCGGCA ACACAACAAG CACTGAACAC AATGCCAACT TCAATACTGC TAATTATTAC 5100 

AACCATGATC ATGGCATCTT TCTGCCAAAT AGATATCACA AAACTACAGC ACGTAGGTGT 516 0 

ATTGGTCAAC AGTCCCAAAG GGATGAAGAT ATCACAAAAC TTTGAAACAA GATATCTAAT 522 0 

TTTGAGCCTC ATACCAAAAA TAGAAGACTC TAACTCTTGT GGTGACCAAC AGATCAAGCA 52 8 0 

ATACAAGAAG TTATTGGATA GACTGATCAT CCCTTTATAT GATGGATTAA GATTACAGAA 534 0 

AGATGTGATA GTAACCAATC AAGAATCCAA TGAAAACACT GATCCCAGAA CAAAACGATT 54 00 

CTTTGGAGGG GTAATTGGAA CCATTGCTCT GGGAGTAGCA ACCTCAGCAC AAATTACAGC 54 6 0 

GGCAGTTGCT CTGGTTGAAG CCAAGCAGGC AAGATCAGAC ATCGAAAAAC TCAAAGAAGC 5520 

AATTAGGGAC ACAAACAAAG CAGTGCAGTC AGTTCAGAGC TCCATAGGAA ATTTAATAGT 55 8 0 

AGCAATTAAA TCAGTCCAGG ATTATGTTAA CAAAGAAATC GTGCCATCGA TTGCGAGGCT 564 0 

AGGTTGTGAA GCAGCAGGAC TTCAATTAGG AATTGCATTA ACACAGCATT ACTCAGAATT 57 00 

AACAAACATA TTTGGTGATA ACATAGGATC GTTACAAGAA AAAGGAATAA AATTACAAGG 5760 

TATAGCATCA TTATACCGCA CAAATATCAC AGAAATATTC ACAACATCAA CAGTTGATAA 5820 

ATATGATATC TATGATCTGT TATTTACAGA ATCAATAAAG GTGAGAGTTA TAGATGTTGA 58 8 0 

CTTGAATGAT TACTCAATCA CCCTCCAAGT CAGACTCCCT TTATTAACTA GGCTGCTGAA 594 0 

CACTCAGATC TACAAAGTAG ATTCCATATC ATATAACATC CAAAACAGAG AATGGTATAT 6 000 

CCCTCTTCCC AGCCATATCA TGACGAAAGG GGCATTTCTA GGTGGAGCAG ACGTCAAAGA 6 060 

ATGTATAGAA GCATTCAGCA GCTATATATG CCCTTCTGAT CCAGGATTTG T ATTAAAC C A 6120 

TGAAATAGAG AGCTGCTTAT CAGGAAACAT ATCCCAATGT CCAAGAACAA CGGTCACATC 6180 

AGACATTGTT CCAAGATATG CATTTGTCAA TGGAGGAGTG GTTGCAAACT GTATAACAAC 624 0 

CACCTGTACA TGCAACGGAA TTGGTAATAG AATCAATCAA CCACCTGATC AAGGAGTAAA 63 00 

AATTATAACA CATAAAGAAT GTAGTACTGT GGGTATCAAC GGAATGCTGT TCAATACAAA 6360 

TAAAGAAGGA ACTCTTGCAT TCTATACACC AAATGATATA ACACTAAACA ATTCTGTTAC 642 0 

ACTGGATCCA ATTGACATAT CAATCGAGCT CAACAAGGCC AAATCAGATC TAGAAGAATC 648 0 

AAAAGAATGG ATAAGAAGGT CAAATCAAAA ACTAGATTCT ATTGGAAATT GGCATCAATC 6540 

TAGCACTACA ATCATAATTA TTTTGATAAT GATCATTATA TTGTTTATAA TTAATATAAC 6600 
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GATAATTACA 
TGACAAGCCA 
TATAAAAAAC 
CCCAATAGAC 
TGCTGGTAAT 
AATATACATA 
AATTAATTCC 
GTTTATGGAA 
GTCAGGAGTG 
ATCATTGACA 
TGATAATCAA 
TCCAGATGAT 
AAGGTTAATG 
AACTCCGTCT 
AGGTTGTCAG 
CTCAGACTTG 
TAGGAAGTCA 
CAAAGTTGAT 
TGTCAATTAT 
T C AAC CAT AT 
AATATTTCTC 
AACTGGGTGC 
TTCAGATAGG 
AAAATTGAAA 
ACTTCTACTA 
ACAATTAGGA 
TAATGTGCTA 
ATGTATAACA 
ATCTGTCATA 
AACCGAAAGA 
AACAAGC TGC 
TAAAAGCTTA 
TTAATCATAA 
ATCAGCAATC 
GGGAAATGGA 


ATTGCAATTA 
TATGTACTAA 
TTAGGAGTAA 
AAATCCAAAT 
GAGCTGGAGA 
TTATGGACAA 
ATCAAAAGTG 
ATTACAGAAA 
AATACAAGGC 
CAACAGATGT 
GAAGTGCTGC 
TTTTGGAGAT 
CCAGGGCCGG 
TTAGTTATAA 
GATATAGGAA 
GTACCTGACT 
TGTTCTCTAG 
GAAAGATCAG 
GATGGTTCAA 
GCTGCACTAT 
GGGTATGGAG 
CCCGGGAAAA 
AGGATGGTCA 
GTATGGACGA 
GGTAACAAGA 
ATAATTGATA 
TCAAGAC CAG 
GGAGTATATA 
TTAGACTCAC 
GTAAACGAGC 
ATTACACACT 
AACACATTTC 
TTAACCATAA 
AGACAATAGA 
CACTGAATCT 


AGTATTACAG 

CAAACAAATA 

AGTTACGCAA 

TCGAGATGGA 

CGTCTATGGC 

TAATCCTGGT 

AAAAGGCCCA 

AGATCCAAAT 

TTCTTACAAT 

CAGATCTTAG 

CACAAAGAAT 

GCACGTCTGG 

GATTATTAGC 

ATGATCTGAT 

AATCATATCA 

TAAATCCTAG 

CACTCCTAAA 

ATTATGCATC 

TCTCAACAAC 

ACCCATCTGT 

GTCTTGAACA 

CACAGAGAGA 

ACTCCATCAT 

TATCTATGCG 

TCTATATATA 

TTACTGATTA 

GAAACAATGA 

CTGATGCATA 

AAAAATCGAG 

TGGCCATCCT 

ATAACAAAGG 

AACCCATGTT 

TATGCATCAA 

CAAAAGGGAA 

AACAATGGCA 


AATTCAAAAG 
ACATATCTAC 
TCCAACTCTA 
ATACTGGAAG 
TAC T C ATGGC 
GTTATTATCA 
CGAATCATTG 
GGCATCGGAT 
TCAGAGTCAT 
GAAATTCATT 
AACACATGAT 
TCTTCCATCT 
TATGCCAACG 
TTATGCTTAT 
AGTCTTACAG 
GATCTCTCAT 
TATAGATGTA 
ATCAGGCATA 
AAGATTTAAG 
TGGACCAGGG 
TCCAATAAAT 
CTGTAATCAA 
TGTGGC TG AC 
ACAAAATTAC 
TACAAGATCT 
CAGTGATATA 
ATGTCCATGG 
TCCACTCAAT 
AGTGAACCCA 
AAACAGAACA 
ATATTGTTTT 
GTTCAAAACA 
TCTATCTATA 
ATATAAAAAA 
CTGTATCTGA 


AGAAATCGAG 
AGATCATTAG 
CTCATATAAT 
CATACCAATC 
AACAAGCTCA 
ATAGTCTTCA 
CTGCAAGACA 
AATACCAATG 
GTCCAGAATT 
AGTGAAATTA 
GTAGGTATAA 
TTAATGAAAA 
ACTGTTGATG 
ACCTCAAATC 
ATAGGGATAA 
AC CTTTAAC A 
TATCAACTGT 
GAAGATATTG 
AATAATAACA 
ATATACTACA 
GAGAATGTAA 
GCATCTCATA 
AAAGGCTTAA 
TGGGGGTCAG 
ACAAGTTGGC 
AGGATAAAAT 
GGACATTCAT 
CCCACAGGGA 
GTCATAACTT 
CTCTCAGCTG 
CATATAGTAG 
GAGATTCCAA 
ATACAAGTAT 
CTTAGGAGCA 
CATACTCTAT 


TGGATCAAAA 

ATATTAAAAT 

TGAGGAAGGA 

ACGGAAAGGA 

CTAATAAGAT 

TCATAGTGCT 

TAAATAATGA 

ATCTAATACA 

ACATACCAAT 

CAATTAGAAA 

AACCTTTAAA 

CTCCAAAAAT 

GCTGTGTTAG 

TAATTACTCG 

TAACTGTAAA 

TAAATGACAA 

GTTCAACTCC 

TACTTGATAT 

TAAGCTTTGA 

AAGGCAAAAT 

TCTGCAACAC 

GTACTTGGTT 

ACTCAATTCC 

AAGGAAGGTT 

ATAGCAAGTT 

GGACATGGCA 

GTCCAGATGG 

gcattgtgtc 

ACTCAACAGC 
GATATACAAC 
AAATAAATCA 
AAAGCTGCAG 
ATGATAAGTA 
AAGCGTGCTC 
C C TG AGTGTC 


6660 

6720 

6780 

6840 

6900 

6960 

7020 

7080 

7140 

7200 

7260 

7320 

7380 

7440 

7500 

7560 

7620 

7680 

7740 

7800 

7860 

7920 

7980 

8040 

8100 

8160 

8220 

8280 

8340 

8400 

8460 

8520 

8580 

8640 


8700 



ACCTTAACTC 

TCCTATCGTT 

AAAGGTAAAA 

TAGCACAATT 

ACACACTATT 

ATGAGTCTAC 

8760 

CTCAGCCTTA 

TGATATGGAT 

GACGACTCAA 

TACTAGTTAT 

CACTAGACAG 

AAAATAAAAC 

8820 

TTAATAAATT 

GGATAAAAGA 

CAACGATCTA 

TTAGAAGATT 

AAAATTAATA 

TTAACTGAAA 

8880 

AAGTGAATGA 

CTTAGGAAAA 

TACACATTTA 

TCAGATATCC 

AGAAATGTCA 

AAAGAAATGT 

8940 

TCAAATTATA 

TATACCTGGT 

ATTAACAGTA 

AAGTGACTGA 

ATTATTACTT 

AAAGCAGATA 

9000 

GAACATATAG 

TCAAATGACT 

GATGGATTAA 

GAGATCTATG 

GATTAATGTG 

CTATCAAAAT 

9060 

TAGCCTCAAA 

AAATGATGGA 

AGCAATTATG 

ATCTTAATGA 

AGAAATTAAT 

AATATATCGA 

9120 

AAGTTCACAC 

AACCTATAAA 

TCAGATAAAT 

GGTATAATCC 

ATTCAAAACA 

TGGTTTACTA 

9180 

TCAAGTATGA 

TATGAGAAGA 

TTACAAAAAG 

CTCGAAATGA 

GATCACTTTT 

AATGTTGGGA 

9240 

AGGATTATAA 

CTTGTTAGAA 

GACCAGAAGA 

ATTTCTTATT 

GATACATCCA 

GAATTGGTTT 

9300 

TGATATTAGA 

TAAACAAAAC 

TATAATGGTT 

ATCTAATTAC 

TCCTGAATTA 

GTATTGATGT 

9360 

ATTGTGACGT 

AGTCGAAGGC 

CGATGGAATA 

TAAGTGCATG 

TGCTAAGTTA 

GATCCAAAAT 

9420 

TACAATCTAT 

GTATCAGAAA 

GGTAATAACC 

TGTGGGAAGT 

GATAGATAAA 

TTGTTTCCAA 

9480 

TTATGGGAGA 

AAAGACATTT 

GATGTGATAT 

CGTTATTAGA 

ACCACTTGCA 

TTATCCTTAA 

9540 

TTCAAACTCA 

TGATCCTGTT 

AAACAACTAA 

GAGGAGCTTT 

TTTAAATCAT 

GTGTTATCCG 

9600 

AGATGGAATT 

AATATTTGAA 

TCTAGAGAAT 

CGATTAAGGA 

ATTTCTGAGT 

GTAGATTACA 

9660 

TTGATAAAAT 

TTTAGATATA 

TTTAATAAGT 

CTACAATAGA 

TGAAATAGCA 

GAGATTTTCT 

9720 

CTTTTTTTAG 

AACATTTGGG 

CATCCTCCAT 

TAGAAGCTAG 

TATTGCAGCA 

GAAAAGGTTA 

9780 

GAAAATATAT 

GTATATTGGA 

AAACAATTAA 

AATTTGACAC 

TATTAATAAA 

TGTCATGCTA 

9840 

TCTTCTGTAC 

AATAATAATT 

AACGGATATA 

GAGAGAGGCA 

TGGTGGACAG 

TGGCCTCCTG 

9900 

TG AC ATT AC C 

TGATCATGCA 

CACGAATTCA 

TCATAAATGC 

TTACGGTTCA 

AACTCTGCGA 

9960 

TATCATATGA 

AAATGCTGTT 

GATTATTACC 

AGAGCTTTAT 

AGGAATAAAA 

TTCAATAAAT 

10020 

TCATAGAGCC 

TCAGTTAGAT 

GAGGATTTGA 

CAATTTATAT 

GAAAGATAAA 

GCATTATCTC 

10080 

CAAAAAAATC 

AAATTGGGAC 

ACAGTTTATC 

CTGCATCTAA 

TTTACTGTAC 

CGTACTAACG 

10140 

CATCCAACGA 

ATCACGAAGA 

TTAGTTGAAG 

TATTTATAGC 

AGATAGTAAA 

TTTGATCCTC 

10200 

ATCAGATATT 

GGATTATGTA 

GAATCTGGGG 

ACTGGTTAGA 

TGATCCAGAA 

TTTAATATTT 

10260 

CTTATAGTCT 

TAAAGAAAAA 

GAGATCAAAC 

AGGAAGGTAG 

ACTCTTTGCA 

AAAATGACAT 

10320 

ACAAAATGAG 

AGCTACACAA 

GTTTTATCAG 

AGACCCTACT 

TGCAAATAAC 

ATAGGAAAAT 

10380 

TCTTTCAAGA 

AAATGGGATG 

GTGAAGGGAG 

AGATTGAATT 

AC TTAAGAGA 

TTAACAACCA 

10440 

TATCAATATC 

AGGAGTTCCA 

CGGTATAATG 

AAGTGTACAA 

TAATTCTAAA 

AGCCATACAG 

10500 

ATGACCTTAA 

AACCTACAAT 

AAAATAAGTA 

ATCTTAATTT 

GTCTTCTAAT 

CAGAAATCAA 

10560 

AGAAATTTGA 

ATTCAAGTCA 

ACGGATATCT 

ACAATGATGG 

ATACGAGACT 

GTGAGCTGTT 

10620 

TCCTAACAAC 

AGATCTCAAA 

AAATACTGTC 

TTAATTGGAG 

ATATGAATCA 

ACAGCTCTAT 

10680 

TTGGAGAAAC 

TTGCAACCAA 

ATATTTGGAT 

TAAATAAATT 

GTTTAATTGG 

TTACACCCTC 

10740 

GTCTTGAAGG 

AAGTACAATC 

TATGTAGGTG 

ATCCTTACTG 

TCCTCCATCA 

GATAAAGAAC 

10800 
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ATATATCATT 
TAGAAGGATT 
CTGTTAGAAT 
TAACCACAAG 
ATGTAGTGAG 
AATTAAATGA 
ATGGGAGAAT 
CAGTAATAGA 
TTGAGAATGG 
AACTATATAT 
AGTATTTTAG 
GATTCAATCA 
CCGCATTGGC 
ATAGGATTAT 
ATTCATGCAA 
GGAATGTATT 
TAGAAGAAGA 
TTGCACATGA 
TAGATACGAC 
TGTTGAGGAA 
GACTAATTGT 
CATTGCGACA 
CGCCTGACCC 
TATGTTATTC 
AAATAGGATC 
CTGATGAAAG 
CCGCAATAAG 
TGGAAGCCTC 
TAACAC CGGT 
TGAAATTCTC 
ACATGTCTAT 
TGTTAACAGG 
ACC CTATAGT 
ATGAACATAT 
TTATTTATGA 


AGAGGATCAC 
TTGTCAAAAA 
AGGCGTGAGG 
AGTACCCAAC 
ATTTTTTGAT 
AACGATTATA 
TCTTCCTCAA 
CGAAACAAGA 
TTATTCACCT 
TGCCCTT GGG 
GAATCCAAAT 
CATGGCGATG 
TGATATTAAA 
GAATCAAGAA 
TTTACCACAA 
ACAAGATTCA 
TGAAGAATTA 
TATTCTAGAT 
AAAATCACTA 
AATCAGTAAT 
AAGTGATAAA 
AAAGATGTGG 
ATTAGAATTA 
TTCAGATGGC 
AGCAGAAACA 
ATCTGAAGCA 
AATAGCAATG 
ACAGATAGCA 
AGCTACATCA 
CAGTACATCA 
CAAAGAAGCT 
ATTAAGTGTT 
TATGCATCTG 
TAATCCAGAG 
TAAAGACCCA 


CCTGATTCTG 

TTATGGACAC 

GTGACTGCAA 

AATTATGACT 

TCATTAAGAG 

AGTAGCAAGA 

GCTCTAAAAG 

TCAGCATCTT 

GTTCTAGGAT 

ATGAATATCA 

TGGATGCAAT 

TCAAGATGTT 

AGATTTATTA 

CCAGGTGAGT 

TCTCAAAATA 

CCAAATCCAT 

GCTGAGTTCC 

AATTCTCTCA 

ATTCGGGTTG 

TACGATCTAG 

ATCAAGTATG 

ATTCATTTAT 

CTATCTGGGG 

ACAAACCCAT 

GGTATATCGT 

CAATTAGGAT 

ATATATACAT 

CAAACACGTG 

ACAAATTTAT 

TTGATCAGAG 

AATGAAACCA 

TTCGAATATT 

CACATAGAAG 

TCTACATTAG 

CTCAAAGATG 


GTTTTTACGT 
TCATATCTAT 
TGGTTCAAGG 
ACAGAGTTAA 
AAGTGATGGA 
TGTTCATATA 
CATTATC TAG 
CAAATTTGGC 
ATGCATGCTC 
ATC CAACTAT 
ATGCCTCTTT 
TTGTAAGGAA 
AGGCGAATCT 
CATCTTTTTT 
TAACCACCAT 
TATTATCTGG 
TGATGGACAG 
CAGGAATTAG 
GCATAAATAG 
TACAATATGA 
AAGATATGTG 
CAGGAGGAAG 
TAGTAATAAC 
ATACTTGGAT 
CATTAAGAGT 
ATATCAAGAA 
GGGCATTTGG 
CAAATTTTAC 
CACACAGATT 
TCAGCAGATT 
AAGATACTAA 
TATTTAGATT 
ATGAGTGTTG 
AATTAATTCG 
TGGACTTATC 


TCATAACCCA 
AAGTGCAATA 
AGACAATCAA 
GAAGGAGATA 
TGATCTAGGT 
TAGCAAAAGA 
ATGTGTCTTC 
AACATCATTT 
AATTTTTAAG 
AACACAGAAT 
AATACCTGCT 
TATTGGTGAT 
ATTAGACCGA 
TGATTGGGCT 
GATAAAAAAT 
ATTATTCACA 
GAAGGTAATT 
AAATGCCATA 
AGGAGGACTG 
AACACTAAGT 
TTCGGTAGAC 
GATGATAAGT 
AGGATCAGAA 
GTATTTAC CC 
TCCTTATTTT 
TCTTAGTAAA 
TAATGATGAG 
ACTAGATAGT 
AAAGGATACT 
CATAACAATG 
TCTTATTTAT 
AAAAGAAACC 
TATTAAAGAA 
ATATCCTGAA 
AAAACTTATG 


AGAGGGGGTA 

CATCTAGCAG 

GCTATAGCTG 

GTTTATAAAG 

CATGAACTTA 

ATCTATTATG 

TGGTCAGAGA 

GCAAAAGCAA 

AATATTCAAC 

ATCAGAGATC 

AGTGTTGGGG 

CCATCAGTTG 

AGTGTTCTTT 

TCAGATCCAT 

ATAACAGCAA 

AATACAATGA 

CTCCCTAGAG 

GCTGGAATGT 

ACATATAGTT 

AGGACTTTGC 

CTTGCCATAG 

GGACTTGAAA 

CATTGTAAAA 

GGTAATATCA 

GGATCAGTCA 

CCTGCAAAAG 

ATATCTTGGA 

CTCAAAATTT 

GCAACTCAGA 

TCCAATGATA 

CAACAAATAA 

ACAGGACACA 

AGTTTTAATG 

AGTAATGAAT 

GTTATTAAAG 


10860 

10920 

10980 

11040 

11100 

11160 

11220 

11280 

11340 

11400 

11460 

11520 

11580 

11640 

11700 

11760 

11820 

11880 

11940 

12000 

12060 

12120 

12180 

12240 

12300 

12360 

12420 

12480 

12540 

12600 

12660 

12720 

12780 

12840 


12900 



200 




ACCATTCTTA 

CAATATGTAC 

AAGAGATAAT 

CTCTTGACAT 

ACACTCTTTA 

CACTGAGAGA 

AAGTATTCAA 

CTAGTCAAGA 

TGAGAGAATG 

TTGCAAATGA 

CAGAAATTGC 

TATTGAAACA 

TATCTGGATT 

TCAAATATCT 

TAGAAGATGA 

ATAATAAAGG 

TTAAAATCAG 

GTGGTTTGAC 

TCAACAGCAC 

ATAAAGACAA 

ATGCCACATT 

TTGGTCAACG 

GAAATGTGAC 

CATGGATAGG 

TTGGATTAGT 

ATGAACATTA 

TTTCCAAAAT 

TATATTGGAA 

TATATCTAAT 

CAAAACTTAA 

CAAAGAAGAG 

GAATCATGAG 

ATCTGGCGAA 

GTTTTCAGCG 

AGAGACATAA 


CACAATTGAT 

TGCAATTACA 

AGTTATTGCA 

ACTTGTATTT 

TAGTCTAAAA 

TACTTCCCAT 

GAGGTTCTGG 

CCAGATAAAA 

GTTGAATGGT 

TAGGAAACAA 

ATCTTTCGGA 

ATATCTTGAA 

ATTAATTAAA 

AAGGATTCGC 

AAATATGCTG 

GAATAAAATT 

ATCTATAACA 

ACTTCCTCAA 

TAGTTGTCTG 

GGACAGGCTC 

AGGACCTGCA 

AGAATTGAAA 

ACAGATTCTT 

AAATATGGAA 

ACATTGTGAT 

TAGTGTTATA 

TATACCTACA 

AGATGTAAGT 

TTCGAAAGAT 

AAGATTGTCA 

GAATAATGAA 

ACCATATCAT 

AGAATTTTTA 

AACAATAAAG 

ATTAGGCGGA 


ATGAATTATT 
AT AG C AG AT A 
AATGATGATG 
CTCAAGACAT 
ATAGAAGGTA 
TCAATATTAA 
GATTGTGGAG 
CTTGCCCTAT 
GTATCACTTG 
GCCTTTATTT 
CCTAACCTGT 
TTAAATATTA 
TCGTTCCCAT 
GGTATTAGTC 
GATAACATTG 
AACAATTTCT 
AGTGATTCTG 
GGAGGGAATT 
AAAGCTCTTG 
TTCCTGGGAG 
GTTAATTATT 
ATATTTCCTT 
AACAGGGTAA 
TGTGAGAGCT 
ATGGAAGGAG 
AGAATTACAT 
ATCACTCCGA 
ATAATATCAC 
GCATATTGTA 
CTCTTGGAAG 
TGGTTACATC 
ATGGCACTAC 
TCAACCCCAG 
GATGTTTTAT 
AGATATAACA 


GGGATGATAC 

CTATGTCACA 

ATATTAATAG 

TTGGTGGATT 

GGGATCTCAT 

AAGTATTATC 

TTTTAAACCC 

CTATATGTGA 

AAATATACAT 

CTAGACACCT 

TAAACTTAAC 

AAGAAGACCC 

CAACTGTAAC 

CACCTGAGGT 

TCAAAACTAT 

GGGGACTAGC 

ATGATAATGA 

ATCTATCGCA 

AGTTATCACA 

AAGGAGCAGG 

ATAATTCAGG 

CAGAGGTATC 

AAGTACTGTT 

TAATATGGAG 

CTATCGGTAA 

ACTTGATTGG 

ATTGGTCTAG 

TCAAAACTTC 

CTATAATGGA 

AAAATAATCT 

ATGAAATCAA 

AAATCTTTGG 

ATCTGACTAA 

TTGAATGGAT 

TATTCCCACT 


TGACATCATA 

ATTAGATCGA 

CTTAATCACT 

ATTAGTAAAT 

TTGGGATTAT 

TAATGCATTA 

TATTTATGGG 

ATATTCACTA 

TTGTGACAGC 

TTCATTTGTT 

ATACTTGGAG 

TACTCTTAAA 

ATACGTAAGA 

AATTGATGAT 

AAATGATAAC 

ACTTAAGAAC 

TAGACTAGAT 

TCAATTGAGA 

AATTTTAATG 

AGCTATGCTA 

TTTGAATATA 

ATTAGTAGGT 

CAATGGGAAT 

TGAATTAAAT 

ATCAGAAGAA 

GGATGATGAT 

AATACTTTAT 

TAATCCTGCA 

ACCTAGTGAA 

ATTAAAATGG 

AGAAGGAGAA 

ATTTCAAATC 

TATCAACAAT 

TAATATAACT 

GAAAAATAAG 


CATGCAATTT 
GATAATTTAA 
GAATTTTTGA 
CAATTTGCAT 
ATAATGAGAA 
TCTCATCCTA 
C CTAATATCG 
GATCTATTTA 
GATATGGAAG 
TGTTGTTTAG 
AGACTTGATC 
TATGTACAAA 
AAGACTGCAA 
TGGGATCCGG 
TGTAATAAAG 
TATCAAGTCC 
GCTAATACAA 
TTATTCGGAA 
AAGGAAGTCA 
GCATGTTATG 
ACAGATGTAA 
AAAAAATTAG 
CCTAATTCAA 
GATAAGTCCA 
ACTGTTCTAC 
GTTGTTTTAG 
CTATATAAAT 
TCAACAGAAT 
AT TGTTTT AT 
ATCATTTTAT 
AGAGATTATG 
AATTTAAATC 
ATAATCCAAA 
CATGATGATA 
GGAAAGTTAA 


12960 

13020 

13080 

13140 

13200 

13260 

13320 

13380 

13440 

13500 

13560 

13620 

13680 

13740 

13800 

13860 

13920 

13980 

14040 

14100 

14160 

14220 

14280 

14340 

14400 

14460 

14520 

14580 

14640 

14700 

14760 

14820 

14880 

14940 


15000 



# 


201 


GACTGCTATC 

GAGAAGACTA 

GTATTAAGTT 

GGATTTCATT 

ATCATTATCG 

ACTCGATTAC 

15060 

TTACAGGTCG 

CTTTCCTGAT 

GAAAAATTTG 

AACATAGAGC 

ACAGACTGGA 

TATGTATCAT 

15120 

TAGCTGATAC 

TGATTTAGAA 

TCATTAAAGT 

TATTGTCGAA 

AAACATCATT 

AAGAATTACA 

15180 

GAGAGTGTAT 

AGGATCAATA 

T C AT ATTGGT 

TTCTAACCAA 

AGAAGTTAAA 

ATACTTATGA 

15240 

AATTGATCGG 

TGGTGCTAAA 

TTATTAGGAA 

TTCCCAGACA 

ATATAAAGAA 

CCCGAAGACC 

15300 

AGTTATTAGA 

AAACTACAAT 

CAACATGATG 

AATTTGATAT 

CGATTAAAAC 

ATAAATACAA 

15360 

TGAAGATATA 

TCCTAACCTT 

TATCTTTAAG 

CCTAGGAATA 

GACAAAAAGT 

AAGAAAAACA 

15420 

TGTAATATAT 

ATATACCAAA 

CAGAGTTCTT 

CTCTTGTTTG 

GT 


15462 


(2) INFORMATION FOR SEQ ID NO: 46: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO:46: 

TTGTCTGGGA AT 12 

(2) INFORMATION FOR SEQ ID NO: 47: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 47: 

TTGCCTGGGA AT 12 

(2) INFORMATION FOR SEQ ID NO: 48: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 48: 
TTGTTTGGGA AT 

(2) INFORMATION FOR SEQ ID NO: 49: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 base pairs 

(B) TYPE: nucleic acid 


12 
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(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 49 
TTGTCTGGTA AT 

(2) INFORMATION FOR SEQ ID NO: 50: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 


(xi)' SEQUENCE DESCRIPTION: SEQ ID NO: 50 
AACTTTAAAT TA 

(2) INFORMATION FOR SEQ ID NO: 51: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 51 
AACTTAAAAT TA 

(2) INFORMATION FOR SEQ ID NO: 52: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 52 
TTAAAGACAT TG 

(2) INFORMATION FOR SEQ ID NO: 53: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 


(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 53 


TTTAAGACAT TG 

(2) INFORMATION FOR SEQ ID NO: 54: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 12 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 54 
GCAGATGTCA AG 

(2) INFORMATION FOR SEQ ID NO: 55: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 55 
GCAGATGCCA AG 

(2) INFORMATION FOR SEQ ID NO:5S: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 56 
CGAATCTAAA GA 

(2) INFORMATION FOR SEQ ID NO: 57: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 57 


CGAAGCTAAA GA 



(2) INFORMATION FOR SEQ ID NO: 58: 


(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 58 
GAAATATTGA TC 

(2) INFORMATION FOR SEQ ID NO: 59: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 59 
GAAACATTGA TC 

(2) INFORMATION FOR SEQ ID NO: 60: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 60 
TCTCTACCCA AC 

(2) INFORMATION FOR SEQ ID NO: 61: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 61 
TCGTTAACCA AC 

(2) INFORMATION FOR SEQ ID NO: 62: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 base pairs 

(B) TYPE: nucleic acid 



(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 62 
AGTACAATAG GT 

(2) INFORMATION FOR SEQ ID NO: 63: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO:63 
AGTACTGTGG GT 

(2) INFORMATION FOR SEQ ID NO: 64: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 64 
GCACTTGATC CA 

(2) INFORMATION FOR SEQ ID NO: 65: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 65 
ACACTGGATC CA 

(2) INFORMATION FOR SEQ ID NO: 66: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 


(ii) MOLECULE TYPE: CDNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 66 
CCATCATTGT TGTTGACAA 
(2) INFORMATION FOR SEQ ID NO: 67: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 67 
CCATCATTGT GGCTGACAA 
(2) INFORMATION FOR SEQ ID NO: 68: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 68 
TTACATGGCC A 

(2) INFORMATION FOR SEQ ID NO: 69: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 69 
TCACATGGCG A 

(2) INFORMATION FOR SEQ ID NO: 70: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 70 


TTTGGACTGG GC 



(2) INFORMATION FOR SEQ ID NO: 71: 


(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 71 
TTTTGATTGG GC 

(2) INFORMATION FOR SEQ ID NO: 72: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: cDNA 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 72 
GGTCCTAATA CT 

(2) INFORMATION FOR SEQ ID NO: 73: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 73 
GGGCCTAATA TC 

(2) INFORMATION FOR SEQ ID NO: 74: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 74 
C CATAGAG AG TCCATGGAAA GCGACGCTAA AAACTATC 



